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VEXIERVERSUCH: THE LOG RELATIONSHIP BETWEEN 
WORD-FREQUENCY AND RECOGNITION 
OBTAINED IN THE ABSENCE OF 
STIMULUS WORDS! 


ISRAEL GOLDIAMOND anv WILLIAM F. HAWKINS? 
Southern Illinois University 


The relationship between word- 
frequency and recognition has recently 
received considerable experimental 
attention. Words more frequently 
used in the language have produced 
higher intelligibility (articulation) 
scores than those less frequently used 
(2), and similar relations hold for 
other studies of recognition of spoken 
words (9, 12), as well as for visual 
recognition of words flashed on a 
screen (e.g., 10, 16). The extent 
to which the frequency-recognition 
relationship accounts for data from 
both modalities is the basis for con- 
siderable controversy in perceptual 
defense studies (8). This controversy 
relates to the more basic question of 
whether the data obtained are to 
be given a perceptual interpretation 
or a response interpretation. Stated 
otherwise, does frequency affect seeing 
or saying (cf. 13)? This argument 
may have relevance for the validity 


1 This research was made possible through 
grants from the National Science Foundation, 
to whom the author is greatly indebted for 
generous support. 

2 Now at the University of Virginia. 











of sensory inferences drawn from 
articulation and intelligibility scores, 
as well as from other recognition 
data. 

The frequency with which a word 
appears in the language can only be 
estimated; word counts from selected 
publications (6, 18) provide samples 
which may not be representative of 
all Ss (4, 5). In the investigation 
of the frequency-recognition relation- 
ship, nonsense syllables present an 
advantage in that their frequency is 
more amenable to experimental con- 
trol, since they have less of a history 
of prior usage than other words. In 
a training session, for example, one 
syllable may be presented 25 times, 
another 10 times, and so on. Each 
syllable is then presented in a recogni- 
tion session. Such experiments re- 
port frequency-recognition relation- 
ships of the type discussed (1, 7, 11, 
14, 16). 

This study reports such a nonsense 
syllable experiment. Following a 
training session of the type described, 
S was told that one of the training 
words would be flashed subliminally 
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at regular intervals on the screen. 
He was to respond with a word 
whether he saw one or not; when he 
was correct, E would so inform him 
and would initiate a new series. This 
experiment, however, differs in that 
all flashes were blanks. If the previ- 
ously reported frequency-recognition 
relationship is now obtained, this 
relationship cannot be ascribed to 
perception. 

Recognition involves accuracy, and 
accuracy is usually regarded as the 
congruence between S’s responses and 
the stimulus sequence. Hence the 
perceptual inference. In actuality, 
however, E seldom co-monitors the 
signals with S; accuracy is usually 
“the congruence between S’s response 
and E£’s score sheet” (8, p. 35). 

If (a) words more frequently used 
in a training session (where the rela- 
tion between reinforcement and use 
is constant) are also the more fre- 
quent responses in a _ recognition 
session, and if the training session 
words appear equally on E£’s score 
sheet, then (b) the frequent words 
will score more hits than the infre- 
quent words. The frequent words 
should also produce such congruences 
earlier in the series. If, as in the 
ascending Method of Limits, a hit 
terminates the series, and ascending 
stimulus energy values are assigned 
to successive presentations in the 
series, then (c) recognition thresholds 
of more frequent words should be 
lower than those for less frequent 
words. 

This study has its origins in the 
Vexierversuche of classical psycho- 
physics, where blanks were introduced 
as a check on S’s biases (3, p. 479). 
The training session in the current 
experiment would be considered as 
one of many possible ways to bias 
S’s responses, that is, to give re- 
sponses unequal likelihood of appear- 
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ing in the recognition response sample. 
Results consistent with the three 
predictions made have appeared in 
the recognition literature. In this 
experiment, the predictions are made 
without reference to a signal, and 
since none is presented, perception 
is precluded from the interpretation. 
The second and third predictions 
relate E’s scoring procedures and score 
interpretations, respectively, to the 
response bias of the first prediction, 
a function of training procedures 
familiar to students of learning. 


METHOD 


Subjects —The Ss were 25 male college stu- 
dents, volunteers from a sophomore lecture 
section of a physical education course. 

Stimulus words.—Stimulus words were non- 
sense syllables of low association value drawn 
from a list compiled by Hilgard (see 17). Ten 
words were divided into five pairs, and sys- 
tematically varied so that each pair would be 
presented at each of five frequencies; 13 other 
words, introduced to lengthen the training series, 
were presented once and did not appear in the 
recognition series. The paired words were: 
MIV, WUX; TUD, ZOF: HIF, MAF; FEP, 
HAJ; VUK, VOF. The extraneous words 
were: BEJ, DAJ, FEH, JEX, KEJ, LIJ, PUY, 
TEV, TOV, VAB, VEF, ZID, ZIM. The fre- 
quencies used were 25, 10,5,2,1. This resulted 
in 99 cards per S. 

Training procedure.—Each of the words was 
typewritten in }-in. capital letters in the center 
of a 3 X 5-in. white card, with the 25-frequency 
words appearing on 25 cards, and so on. The 
deck was shuffled separately for each S and was 
placed face down between E and S. Every 8 
sec., E exposed a card in view of S, who then 
read the letters and the name, thus: “F-E-P, 
fep.” The Ss were told that these were words 
of a foreign language, with E interested in their 
pronouncing each word as it appeared. These 
responses will be called training responses. 

Interpolated activity.—Following the training 
period, S was given the January, 1958 issue of 
Consumer's Report, opened to an article on 
subliminal advertising, a half-page clipping 
from the Southern Illinoisan, a local daily, 
containing an interview with £ on this subject, 
plus a smaller clipping from a news weekly, 
and was requested to read these. After 10 
min., he was transferred to the perception booth. 
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Perception procedure-—The booth has con- 
trolled and monitored lighting, and S sits 10 ft. 
from a uniformly illuminated screen, in the 
center of which, at eye level, is an 11 X 11-in. 
opal glass screen for rear projection. Instead 
of white light, a gray mottled section from 
Rorschach Plate I was flashed at each presenta- 
tion.? Interruption of the light was through a 
Wollensak solenoid-operated shutter connected 
to Hunter interval timers. Communication 
with E was through a high-fidelity two-way 
system. 

The S was told that 2 sec. after the warning 
bell, one of the foreign words he had pronounced 
would be flashed into the window, so quickly 
that he might see only a flash; whether he saw 
a word or not, he was to respond with one, since 
the purpose of the experiment was to relate 
behavior to unseen or barely seen stimuli. 
Within three presentations, Ss were responding 
with a word occasioned by the window lighting 
up. That a presentation was made was discern- 
ible; duration was .02 sec., with interval being 
7 sec. All temporal relations and bell were 
automatic. 

Score sheet and psychophysical method.—The 
method mimicked was the ascending Method 
of Limits; S was told that the same word would 
be presented until he got it right, that he would 
be informed when correct, and that another 
word (or the same word, by random selection) 
would then be shown. This procedure was used 
to avert perseverations; the data obtained can 
be analyzed as though a random presentation 
method were being mimicked. The E score 
sheet had 10 columns, each headed by a word. 
These were balanced so that each word would 
appear equally as the first or second, the third 
or fourth, and so on in the 10 series. When S 
said the word at the top of the column, E stated: 
“That was the correct word. We shall now try 
another series,”’ and began to make entries in the 
next column. All words given were recorded 
in the appropriate column. A pilot study hav- 
ing produced no “correct” responses beyond 
Presentation 22, if the word at the head of the 
column were not given by Presentation 25, E 
said: “We shall now try another series,” and 
began to make entries in the next column. 

It will be noted that by this procedure, all 
Ss did not give an equal number of responses, 
nor was there the same number of entries in 
each column, since giving the word at the head 


3 This was used because it was available, and 
produced a random-appearing gray; the basic 
motivation being that “something” should be 
presented. This precaution turned out to be 
unnecessary; Ss in a later experiment reacted 
as well to pure white light. 
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of the column terminated entries for that 
column, as in the studies using the ascending 
Method of Limits. In these studies, all entries 
preceding the correct one are called “prerecogni- 
tion hypotheses.” Responses given in the 
booth will be called perceptual responses. 

Three Ss failed to produce a single correct 
response, and a fourth produced only one. 
Their records contained runs of the same syl- 
lables. Since the same learning procedure was 
used for all Ss who vary in learning in both 
training procedures and instructional periods, 
the performance of these Ss was considered 
indicative of poor learning procedures, and they 
were replaced by four other Ss prior to the 
analysis of the data. 


RESULTS 


Perceptual-response frequency as a 
function of training-res ponse frequency.— 
This relationship is presented in Fig. 
1; the logarithmic relationship be- 
tween the two frequencies usually 
found in recognition experiments of 
this type has been obtained here. 
The degree of association between 


25 4+ 
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AVERAGE RESPONSE FREQUENCY (MEAN “INTELLIGIBILITY") 
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T — 
! 2 5 10 25 


LOG FREQUENCY TRAINING 


Fic. 1. Mean frequency of perceptual re- 
sponses as a function of training frequency. If 
treated as ascending Method of Limits, mean 
frequency of “prerecognition hypotheses” as a 
function of training frequency. If treated as 
random series, mean “recognizability” (“intel- 
ligibility”) as a function of training frequency 
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obtained order and predicted order, 
as measured by Kendall’s Tau, is 1.00, 
which has a significance value of 
P< .01 for five ranks. For the 
training frequencies of the experi- 
ment, the average perceptual response 
frequency is: 25, 22.18; 10, 12.30; 5, 
9.88; 2, 2.38; 1, .83 (two perceptual 
words, 1.04; 13 extraneous words, .80). 

These results confirm the prediction 
made and suggest that had a random 
method been mimicked, instead of 
the ascending Method of Limits, S’s 
congruences would have tended to 
follow the distribution in Fig. 1, 
and “intelligibility” and “recogni- 
tion” scores would have followed suit 
as an artifact of this relationship. 
If the data are interpreted as relating 
to the ascending method, Fig. 1 can 
be considered as depicting “prerecog- 
nition hypotheses.” 

Number of congruences —When S 
gave the word at the head of a column, 
that series was terminated, as in the 
ascending Method of Limits recogni- 
tion studies, where such congruence 
defines recognition. The relationship 
between the percentage of words 
thus recognized and training fre- 
quency is indicated in Fig. 2, in which 


PERCENT “RECOGNIZED” 
& 3 











1 2 5 10 25 
LOG FREQUENCY TRAINING 
Fic. 2. Percentage of “presented” syllables 


“recognized” as a function of training frequency. 
Data treated as ascending Method of Limits. 
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TABLE 1 


Numser oF Recocnitions For Eacu 
FREQUENCY AS A FUNCTION OF 
Numser oF REcoGNITIONS 









































Per S 
Word-Frequency 
Words Ss 
Recognized 
25 10 5 2 1 
7 s |10/10| 8 | 4] 3 
6 5 8 10 7 ere 
5 4/7]8]3)]2{0 
+ 5 8 6 6 0 0 
3 6 9 3 5 0 1 
Totals | 2s | 42 |37 [29 | 10/ 5 
Number unrecognized 8 | 13 | 21 | 40 | 45 
Mean presentation at 
which recognized word 
was recognized 4.50 | 6.78 | ane 10.80 








the logarithmic relationship found 
in Fig. 1 is continued. 

No S recognized all words. Table 
1 relates the total number of words 
recognized by each S to the number 
of words recognized by S for each 
frequency. If S recognized all 10 
words, the recognitions would be 
evenly distributed among all five 
frequencies. If he recognized only 
one, this should tend to be a 25-train- 
ing frequency word, with a curve in 
between gradually moving away from 
the 25-frequency words. The table 
indicates that recognitions generally 
distributed themselves in accordance 
with this expectation. 

Average threshold.—With regard to 
the relationship between thresholds 
and frequency, to what extent are 
the results of this experiment similar 
to results obtained where signals are 
actually presented? Assuming that 
signals were presented here would 
require that stimulus energy be in- 
creased with each presentation by 
some physical value we shall call an 
erg. It will be noted from Table 
1 that many words required energy 
levels higher than 25 ergs for recogni- 
tion—they were not recognized by 
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2 
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AML. "RECOGNITION" THRESHOLD ("RECOGNITION SPEED") 








~~ 
ly 4 
° ' Lu T 
I 2 5 10 25 
LOG FREQUENCY TRAINING 
Fic. 3. “Threshold energy” required to 


“recognize” words as a function of training 
frequency. Data treated as ascending Method 
of Limits. 


Presentation 25. If we assume that 
all words, regardless of frequency, 
require an average of 26 ergs for 
recognition, we can compute an 
average threshold for each word, 
based upon number of words un- 
recognized at Presentation 25 (given 
values of 26), plus number of words 
unrecognized at preceding presenta- 
tions, but previously congruent 
through the interaction of positive 
response bias with score sheet entries 
(given values of the presentation of 
congruence). Such average thresh- 
olds are presented in Fig. 3. The 
ordinate is expressed in terms of 
relative stimulus energy; y can stand 
for ergs, milliseconds, millivolts, and 
the like. The inverse logarithmic 
relation between stimulus energy and 
frequency reported in the literature, 
and often considered indicative of 
perceptual effects, has been obtained. 
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Other results—The syllables were 
selected from a list of words of equal 
association value; that their values 
were not equal for the population 
in this study is suggested by marked 
differences in the frequency of per- 
ceptual responses given. Each of the 
recognition words was presented an 
average of 8.3 times in the training 
session; the number of perceptual 
responses for each was: TUD, 394; 
HAJ, 382; MIV, 294; WUX, 270; 
VOF, 230; FEP, 200; MAF, 192; 
HIF, 153; VUK, 143; ZOJ, 127. The 
13 nonrecognition words were pre- 
sented once in the training session; 
their perceptual response frequencies 
range from TEV, 103; PUY, 42; to 
KEJ, ZUM, LiJ, each 0. 

A total of 1259 neologisms were given 
by all Ss, constituting .31 of all responses. 
All but 156 of these shared two letters 
with the training words. Considera- 
tion of these as partial responses 
(or partial “recognitions’’) did not 
alter the logarithmic functions when 
added to the data. No consistency 
could be found in their use. Words 
scoring high on partial response 
scored both high and low on full 
response; similar results obtained for 
other scores. The analysis was com- 
plicated by duplications (for example, 
T-V in TEV and TOV), for which 


arbitrary corrections were made. 


Discussion 


The results of this study can be 
interpreted as challenging a perceptual 
interpretation of the relationship between 
word-frequency and recognition-intelligi- 
bility, where word-frequency can be 
placed under laboratory control. Per- 
ception was not involved in this study, 
yet the logarithmic recognition-frequency 
curves were obtained. If we pretend, 


as in Fig. 3, that syllables were flashed 
on the screen, the data make sense when 
we assume that all syllables are equally 
Stated otherwise, we as- 


perceptible. 
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sume that frequency as a variable does 
not affect perceptibility. If, continuing 
with the signal pretense, we interpret 
the data to indicate that frequency does 
affect perception—that is, the results 
indicate differences in perceptibility— 
we find ourselves in the position of 
stating that the same data can be ade- 
quately explained by having frequency 
both affect perception and not affect it. 
On the other hand, stating that fre- 
quency does not affect perception, but 
does affect response bias, eliminates 
the contradiction as well as explains 
the data. The lowered stimulus energy 
at which recognition of high-frequency 
words occurs has been interpreted to 
mean that “the more frequently a word 
is used . . . the more readily it can be 
seen” (15). This interpretation is open 
to question since the inverse energy- 
frequency relation from which it is drawn 
may simply be an artifact of coupling an 
ascending energy series to sequential 
progression of a series. 

The results of this study may also 
be relevant for interpretation of articu- 
lation tests. Using S as referent, con- 
gruence between responses and E’s 
score sheet is used to define sensitivity; 
using the word as referent, such congru- 
ence is used to define intelligibility. In 
the one case, inferences are often made 
about the person; in the other, about 
the perceptibility of the stimulus. 

Both identification and recognition 
involve an accuracy indicator which 
may be heavily influenced by congru- 
ences related to response biases and 
irrelevant to the perceptual issues under 
study. Although this experiment deals 
with nonsense syllables, the frequency 
of which is experimentally varied, the 
author would consider the data as sug- 
gesting, at least, that similar considera- 
tions may hold for studies relating other 
types of frequency to recognition thresh- 
olds; implications of the consideration 
of the role of response bias are drawn for 
other experiments as well in a recent 
methodological analysis of perceptual 
experiments (8). Most recently, Howes 
(9) -has interpreted parallels between 
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auditory and visual recognition data 
as relating to response variables, and 
has assigned 69% of the variance in 
intelligibility data to word-frequency. 


SUMMARY 


An experiment was conducted to ascertain 
the extent to which previously reported relations 
between word-frequency and recognition thresh- 
olds could be obtained in the absence of a per- 
ceptual stimulus, and could therefore be ex- 
plained without resort to perception. Twenty- 
five Ss were first given a training session in 
which they repeated nonsense syllables at 
different frequencies. They were then told, 
in a recognition session, that these words would 
be flashed subliminally; they were to guess until 
accurate. Accuracy was defined as responding 
with the word on £’s score sheet. Blanks were 
presented throughout. 

A logarithmic relation was found between 
frequency of training and frequency of response 
in the recognition session. Since the training 
words appeared equally on £’s score sheet, 
the more frequent responses led to more hits, 
this accuracy-frequency relationship also being 
logarithmic. Since an ascending Method of 
Limits was mimicked, with accuracy terminating 
a series, the more frequent words also scored 
hits earlier, producing lower thresholds. 

The similarity between these results, which 
cannot be ascribed to perception, and the data 
obtained from similar experiments where a 
stimulus has been presented, was interpreted 
as challenging a perceptual interpretation of the 
word-frequency-recognition relationship, where 
similar procedures are utilized. Implications of 
these results for recognition, identification, 
intelligibility, and articulation studies were 
discussed, with the suggestion offered that the 
existence of a contribution to the variance from 
response bias necessitates caution in inferring 
sensitivity or intelligibility (perceptibility) 
from such data. 
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THE INTERSTIMULUS INTERVAL AND THE LATENCY 
OF THE CONDITIONED EYELID RESPONSE! 


C. ALAN BONEAU 
Duke University 


The effect upon eyelid conditioning 
of the time interval between the 
CS and the UCS has been studied in 
detail, with a rather precise specifica- 
tion of the shape of the function 
emerging from complementary stud- 
ies by Reynolds (12) and Kimble (6). 
Beyond this, recent studies, using the 
technique of changing from one inter- 
stimulus interval to another during 
conditioning, have produced results 
which show that the level of condi- 
tioning can be altered by such manipu- 
lation of the interval. McAllister 
(10), for example, began conditioning 
with an optimal interval and then 
shifted to interstimulus intervals at 
which no conditioning usually occurs. 
For a group shifted from a .5-sec. 
to a 2.5-sec. interstimulus interval, 
the result was a gradual extinction 
of the CR, a marked contrast to the 
abrupt decline exhibited by a group 
extinguished in the usual manner 
with CS alone. Boneau, King, and 
Kimble (2) have reported a similar 
experiment in which the shift was to 
an interval permitting acquisition 
of a CR. The results of this experi- 
ment were similar to those of Mc- 
Allister. There was also some evi- 
dence that responses appropriate to 
the different interstimulus intervals 
behave independently of one another 
after the shift. 

1 Paper based upon a dissertation submitted 
to Duke University in partial fulfillment of the 
requirements for the degree of Doctor of Philos- 
ophy. Research was carried out while the 
author was research assistant on Grant NSF-G- 
2147. The author is indebted to Gregory A. 
Kimble, as principal investigator, for his part 
in directing the project and for his help in 
preparing the report. 


464 


All of these studies have been 
primarily concerned with the amount 
of conditioning defined in terms of 
percentage of CR’s in successive 
blocks of trials. The possibility that 
the interstimulus interval influences 
other measures such as latency has 
largely been ignored. The experi- 
ment reported here studied the effects 
of shifting from an optimal inter- 
stimulus interval to two different non- 
optimal ones and made a detailed 
analysis of the latency of CR’s 
associated with different interstimulus 
intervals. 

METHOD 


Apparatus.—The eyelid-conditioning appa- 
ratus previously employed at Duke University 
(8) was used in this experiment. This apparatus 
photoelectrically responded to the movement 
of the eyelid. A small aluminum-foil eyelash 
attached to S’s right eyelid interrupted a beam 
of light directed at a photocell. Movement of 
the eyelash out of the beam (approximately 
1/8 in. downward excursion at a distance of 
2 in. from the eyelid) was recorded as a blink. 
The S sat in a sound-deadened, lighted cubicle 
with his head resting in a chin-and-head rest. 
Modifications of the apparatus for the present 
experiment included the addition of an Esterline- 
Angus Operations Recorder which recorded 
onset and termination of both the CS and UCS 
as well as the occurrence and termination of 
blinks. In addition to this graphical record, 
a Standard Electric .0l-sec. timer was utilized 
to get direct measurements of the latency of the 
blink recorded from time of onset of CS. 

The CS was a small light } in. in diameter at 
a distance of 20 in. from S. It remained on 
until the termination of the UCS. The UCS 
was an airpuff, nominally .05 sec. in duration at 
a pressure equivalent to 50 mm. of mercury. 
It was released by a solenoid valve an | delivered 
from a distance of approximately $ in. to the 
corner of S’s right eye through a nozzle with a 
3/64-in. aperture. Intertrial intervals, pro- 
grammed by a motor-operated disc, repeated the 
sequence 12, 10, 16, and 22 sec. 
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Subjects—The Ss were undergraduate stu- 
dents at Duke University participating to fulfill 
a course requirement. Of the 87 Ss, five were 
rejected because of a high blink rate on a pretest 
(see Procedure), five were classified as voluntary 
responders on the basis of the criterion proposed 
by Spence and Taylor (14) and were discarded. 
Thirteen others were not used because they 
failed to give UCR’s on five or more trials during 
the conditioning procedure, presumably because 
of the weak airpuff. 

Procedure—Of the remaining 64 Ss, 34 
received 50 trials with an interstimulus interval 
of .5 sec. and were then divided into two groups 
equated in terms of mean number of CR’s 
in this phase of the experiment. One of these 
groups, the .5-1.0 Shift Group, was then switched 
to a 1.0-sec. interstimulus interval for 100 further 
trials. The other group, the .5-1.5 Shift Group, 
received 100 further trials with a 1.5-sec. interval. 
Two control groups, each consisting of 15 Ss, 
were designated the 1.0 Control Group and the 
1.5 Control Group. These groups received 150 
trials at their appropriate interval. A _pre- 
arranged schedule insured randomness of 
assignment of Ss to the shifted group and to the 
two control groups. 

Instructions were those used previously (8) 
with the addition of a statement describing the 
pretest procedure. While instructions were 


recited to S, he was at the, appropriate times 
moved into the cubical, fitted with a false eye- 


lash, and adjusted into the apparatus. The S 
then received four airpuffs alone spaced at the 
usual intertrial intervals. This was followed 
by a pretest series of ten presentations of the 
CS alone at a duration of .5 sec. The Ss were 
immediately discarded if on the pretest they 
gave more than four responses to the CS with a 
latency of .5 sec. or less or gave more than six 
responses with a latency of less than 1.0 sec. 
Following the pretest, S received a 1-min. rest. 
The S then received his 150 trials with a 1-min. 
rest following Trials 40, 80, and 120. No 
ready signal was used. The total time elapsing 
from beginning of instructions to completion 
of the 150 trials was approximately 50 min. 


REsuLTs 


Definition of the CR.—The CR’s 
in eyelid conditioning have generally 
been considered to be responses which 
anticipate the onset of the UCS but 
which are not UCR’s to the CS. If 
test trials are used, CR’s are those 
responses which anticipate the onset 
of the UCS or occur in its absence. 
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Since the latency characteristics of 
responses to different intervals are 
different and since the present experi- 
mental procedure involves a_ shift 
from an optimum interstirnulus inter- 
val to longer ones, the problem of the 
definition of a CR for the specific 
intervals employed had to be faced 
at the outset of the analysis of results. 
In dealing with this problem the 
assumption was made that CR’s will 
tend to increase with increasing 
numbers of trials. Based upon this 
assumption, a CR was defined as a 
response ifi any range of response 
latencies which shows a consistent 
increase in level of responding as 
conditioning proceeds. Operation- 
ally, the distribution of responses by 
latencies for an early stage of condi- 
tioning is compared with that for a 
later stage or stages. That part of 
the distribution which increases from 
one stage to the next constitutes a 
range of response latencies which is 
used to define the CR’s for the inter- 
stimulus interval involved. Such a 
procedure automatically compensates 
for the larger number of random 
blinks expected in the interval be- 
tween CS and UCS as its length 
increases by tending to eliminate them 
from consideration as CR’s. 

Because of the importance of the 
latency distributions, they are pre- 
sented in Fig. 1, 2, and 3 for the .5-, 
the 1.0-, and the 1.5-sec. interstimulus 
intervals, respectively. For all inter- 
stimulus intervals, the responses are 
presented as a set of five separate 
distributions representing successive 
fifths of the conditioning trials. It 
should be noted that the set of dis- 
tributions for the .5-sec. interval is 
based only upon the first 50 trials 
of the 34 Ss who received their 
original conditioning at that interval 
before being shifted. The other dis- 
tributions are based upon the full 150 
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Fic. 1. Latency distributions of responses to 


5-sec. interstimulus interval. 


trials for each S in the appropriate 
control group. In these figures, the 
columns represent the number of 
responses for latency ranges in .05-sec. 
steps. The designating number be- 
low each bar indicates the maximum 
response latency (measured in .01 
sec.) in that column. Distributions 
are truncated at .25 sec. since a 
negligible amount of responding 
occurred before that point. 
Characteristic of the course of 
responding as shown by the sets of 
distributions is a progressive increase 
in the number of responses in the 
region immediately adjacent to the 
point of application of the UCS. 
There can also be noted a tendency 
for the responses with shorter latencies 
to show an initial increase followed by 
a decrease in level of responding for 
all three interstimulus intervals. 
This is paralleled by a shift of the 
median response latency as indicated 
by the arrows. Another expression 
of this tendency is seen, at least for 
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Fic. 2. Latency distributions of responses to 
1.0-sec. interstimulus interval. 


the .5- and 1.0-sec. interstimulus 
intervals, to be a successive narrowing 
of the range of response latencies 
which shows an increase in level of 
responding when compared with the 
first fifth of trials. This suggests 
that the anticipatory blinks tend 
more and more closely to approximate 
the onset of the UCS. Data of a 
different kind illuminate this process. 
The records from the operations 
recorder permitted an examination 
of the temporal relationship between 
blink and puff. Specifically, it was 
possible to determine whether or not 
the blink preceded the puff and 
whether its duration was long enough 
to overlap the puff. Typically, the 
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Fic. 3. Latency distributions of responses to 


1.5-sec. interstimulus interval. 
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records revealed that blinks lasted 
approximately .2 sec., that a sizeable 
number did not overlap the puff and 
thus did not successfully avoid it. 
Analysis of the records for the 1.0 Con- 
trol Group showed a progressive in- 
crease in the percentage of avoiding 
CR’s from 70% for the first fifth to 
90% for the last fifth of conditioning. 
Of the 12 Ss showing a change in per- 
centage of avoiding CR’s, 11 in- 
creased, an event whose occurrence 
by chance has a probability of less 
than .O1. 

Thus there is evidence that, as 
conditioning proceeds, there is an 
increase in the percentage of CR’s, 
and also an increase in the precision 
of the timing of the blink so that it 
tends more and more to overlap the 
puff, presumably vitiating its effects. 
Such a finding complicates the at- 
tempt to establish a simple definition 
of the CR since the region of increased 
responding progressively narrows with 
increasing numbers of trials. Rather 
than use a sliding criterion, the follow- 
ing ranges were rather arbitrarily 
decided upon as definitions of CR’s 
appropriate to their respective inter- 
stimulus intervals: .31 to .50 sec. for 
the .5-sec. interstimulus interval, .66 
to 1.00 sec. for the 1.0-sec. interval, 
and 1.01 to 1.50 sec. for the 1.5-sec. 
interval. As reference to the latency 
distributions will show, these ranges 
fairly adequately satisfy the principle 
which designates as CR’s only those 
responses which show a consistent 
increase in level of responding with 
practice. Such a definition is for the 
1.5-sec. interval least satisfactory. 
However, the set of distributions for 
that interval reflect a large number 
of double blinks, the first occurring 
at approximately .4 sec. and the 
second about 1.0 sec. later, which 6 
of the 15 Ss produced. Since the 
significance of the first blink was 
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Fic. 4. Distribution of response latencies of 


1.5 Control Group for all trials. 


rather obscure, it was decided to 
ignore it at least for the purpose of 
establishing a definition of CR com- 
parable to those for the other inter- 
stimulus intervals. It is interesting 
to note that, due to the double 
blinks, the level of responding for 
the 1.5 Control Group was highest 
in the region just preceding .5 sec. 
Figure 4 presents the total responses 
summed over all 150 trials for the 
1.5 Control Group, grouped by inter- 
vals of 1/10 sec. The marked simi- 
larity of the first half of this curve to 
idealizations of the curve relating 
amount of conditioning to _inter- 
stimulus interval, as in Kimble (7, 
p. 212), is provocative.? 

Learning curves.—Conditioning func- 
tions plotting percentage of CR’s as 
defined above on successive blocks 
of trials are similar to those usually 
obtained (6, 12) with relative per- 
formance best for the .5-sec. interval 


* Hull (5, p. 106) predicted a response curve 
similar to that of Fig. 4: He assumed the CS 
in actuality to be the internal stimulus trace 
resulting from the application of the experi- 
mental CS. Since the intensity of the trace 
presumably rises for the first half-second, then 
declines, there should occur some responses 
early in the interstimulus interval when the 
trace has about the same intensity as at the 
point where the UCS is applied. Then should 
occur a generalization decrement as the trace 
increases in intensity before declining to its 
value at the point of application of UCS. 
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and poorest for the 1.5-sec. interval. 
The effects of shifting from a short to 
a long interstimulus interval appear 
in Fig. 5. Here the responses appro- 
priate to the short interval—i.e., 
lying in the latency range .31 to .50 
sec.—of the .5-1.0 Shift Group are 
compared with like responses for the 
.5-1.5 Shift Group. For the first 50 
trials during which the interstimulus 
interval is .5 sec. for both groups, the 
percentage of CR’s increased for both 
groups. After the shift to a longer 
interstimulus interval, however, these 
short interval responses gradually 
approached a level of less than 10%. 
There is a suggestion that these 
responses extinguished more rapidly 
for the group for which the shift 
was greatest, but the difference was 
not significant. By the Kolmogorov- 
Smirnov test, used because of the 
marked non-normality of the extinc- 
tion data, the difference was significant 
at approximately the 20% level. 
After the shift from a short to a 
long interstimulus interval, responses 
appropriate to the new interstimulus 
interval exhibited an increasing per- 
centage by blocks of 10 trials. The 
acquisition curve for these responses 
following the shift of interval was 
compared with the acquisition curve 
for the appropriate control group to 
determine whether or not the previous 
conditioning at the shorter interval 
had an effect on the acquisition of 
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Fic. 5. Comparison of response curves for 
.5-1.0 Shift Group and .5-1.5 Shift Group. 
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Fic. 6. Comparison of responses with la- 
tency .66 to 1.00 sec. for .5-1.0 Shift Group and 
1.0 Control Group. 


the long interval responses. Figure 
6 makes this comparison between the 
10 blocks of 10 trials following the 
shift for the .5-1.0 Shift Group and 
the acquisition curve for all responses 
for the 1.0 Control Group. There 
can be no question of the lack of a 
significant difference between the two 
curves. A similar but not quite so 
striking result (not presented here) 
holds for the groups conditioning 
with an interstimulus interval of 1.5 
sec. Apparently, the performance 
during conditioning for the longer 
interstimulus intervals increases at 
the same rate whether or not Ss had 
been conditioned previously with a 
shorter interstimulus interval. 

This same finding by Boneau, King, 
and Kimble (2) had led to a considera- 
tion of the possibility that the distri- 
bution of response latencies simply 
changed in temporal position by 
sliding through the region intermedi- 
ate between the two which define 
CR’s. If this were so, the inter- 
mediate region should exhibit a greater 
level of responding for the shift group 
following the shift than is exhibited by 
the control group. Such was not the 
case; nor is it in the present experi- 
ment. Figure 7 presents the level of 
responding for all blocks of 10 trials 
of the 1.0 Control Group in the inter- 
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Fic. 7. Comparison of responses with latency 
51 to .65 sec. for .5-1.0 Shift Group and 1.0 
Control Group. 


mediate region of latencies, .51 to .65 
sec., as well as the corresponding 
responses for the 10 blocks of trials 
of the .5-1.0 Shift Group which 
occurred after the shift. The simi- 
larity between the two curves is 
marked even to the slight rise in 
response level for both groups during 
the fourth block of ten trials after 
the beginning of conditioning at this 
interstimulus interval. For both 
groups, the peak can be attributed to 
the previously observed increased 
level of responding at the shorter 
latencies early in conditioning. As 


before, the results for the 1.0 groups 
were paralleled by the results for the 
1.5 groups, which are not presented 
here. 


Discussion 


The results of shifting from an optimal 
interstimulus interval to a longer, non- 
optimal one may be summarized briefly. 
First, as in McAllister’s study (10), 
the responses appropriate to the shorter 
interval extinguish gradually. Secondly, 
the responses appropriate to the longer 
interstimulus interval, simultaneously 
with the above, describe an acquisition 
curve indistinguishable from that of a 
control group conditioning only at the 
longer interval. Apparently condition- 
ing at the shorter interval has no effect 
on the acquisition of the response 
appropriate to the longer interval. 
Thirdly, in the interval of latencies 
between those intervals which define 
CR’s, the level of responding seems to be 
the same as that for the appropriate 
control group after a similar amount 
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of conditioning at the longer interval. 
This latter fact is evidence that the 
distribution of response latencies does 
not shift in temporal locus in a con- 
tinuous fashion from the region appropri- 
ate to the first interstimulus interval 
to that appropriate to the second inter- 
val. The responses appropriate to the 
two interstimulus intervals thus seem 
to demonstrate independence, one ex- 
tinguishing at the same time the other 
shows acquisition, with no evidence of 
interaction between the two processes. 
These results constitute in detail a 
replication of the results of the Boneau, 
King, and Kimble study. 

From the above results, the prediction 
may be made that the result of a shift 
from one interstimulus interval to a 
longer one will be, if all anticipatory 
responses are considered, simply a grad- 
ual change to a level of responding which 
would have been obtained had the 
conditioning been only at the longer 
interstimulus interval. This is so since 
the short interval responses will ex- 
tinguish to a level of random responding, 
the long interval responses will approach 
their appropriate asymptote, and the 
level of all other responses will be the 
same under both conditions. 

The additional finding of the present 
experiment that the region of increased 
responding, which is used to define CR’s, 
narrows with increased training raises 
an important issue. Such an increase 
in latency as a result of training is in 
disagreement with certain available evi- 
dence. In the well-known studies of 
Pavlov (11, pp. 91-102), the salivary 
response for intervals of a few minutes 
or less typically decreases in latency so 
as to follow the CS immediately. In 
another study to which frequent refer- 
ence is made, Campbell and Hilgard 
(3) report a fairly strong inverse rela- 
tionship (r = —.54) between latency 
and frequency of CR in the eyelid condi- 
tioning situation, seemingly the opposite 
of the present results. The latter dis- 
crepancy may be accounted for by a 
consideration of the conditions under 
which the Campbell and Hilgard experi- 
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ment was conducted. The photographic 
method of recording utilized by them 
necessitated conditions conducive to 
dark adaptation of S. As Grant and 
Norris (4) have shown, dark-adapted Ss 
tend to respond to light with a blink 
(“beta” response) having a latency of 
.12 to .24 sec. The fact that the bulk 
of responses reported by Campbell and 
Hilgard fell in this range would suggest 
that the decrease in latency of responding 
reported by them was due to increasing 
numbers of beta responses rather than 
a decrease in the latency of the true CR. 

In considering the Pavlov results, it 
must be remembered that the salivary 
response of the dog is very dissimilar 
to the human eyelid response, and the 
possibility exists that the two responses 
may function quite differently. A strik- 
ing feature of the eyelid response, for 
example, is the adaptiveness of the 
change of precision of the blink so that 
it tends more and more to avoid the puff. 
Once the CR appears, it behaves as an 
instrumental response in that its char- 
acteristics can be altered by the rein- 
forcing contingencies in the situation 
since analysis of the nonavoiding re- 
sponses has demonstrated that the 
change in latency is indeed related to the 
reinforcement properties. This assumes 
that avoidance of the puff is a reinforcing 
state of affairs, and that responses with 
latencies such that the puff is avoided 
tend to occur predominantly as a result 
of differential reinforcement. In much 
the same way, bar presses of a specified 
pressure tend to occur when reinforce- 
ment is made contingent upon a response 
of that specific pressure (1). The micro- 
molar approach of Logan (9) would seem 
to find a meaningful application here. 
If blinks of different latency are con- 
sidered to be different responses, rein- 
forcement of blinks having latencies such 


that the puff is avoided would thereby . 


increase in response strength, while 
nonavoiding ‘blinks would tend to ex- 
tinguish. The consideration of responses 
of different latencies as being different 
responses is rendered more tenable by 
the observation of the seemingly inde- 
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pendent behavior of responses appropri- 
ate to the long and short interstimulus 
intervals following the shift. Spence 
(13) also has reported results showing 
systematic changes in latencies of condi- 
tioned eyelid responses. Specifically, 
for high-level conditioning Ss the latency 
decreased, while for low-level condi- 
tioners an increase in latency resulted 
as conditioning progressed. While no 
explanation for these divergent results 
was presented, Spence suggested that 
the latency of the response may be 
what is learned. The present results 
support this contention. 

If the preceding analysis is correct, 
the latency characteristics of the classic- 
ally conditioned eyelid response may be 
modified by reinforcing contingencies 
inherent in the situation so that the 
response tends to maximize the rein- 
forcing potential present. The Pav- 
lovian results also are amenable to such 
a treatment. It is only necessary to 
make the reasonable assumption that 
saliva in the mouth of the dog at the 
same time food powder or acid is applied 
to the tongue is a reinforcing state of 
affairs. If this be so, then, up to a point, 
more saliva, the result of a CR with a 
shorter latency, should be even. more 
reinforcing. The observed decrease in 
latency would easily follow from the prin- 
ciple of maximization of reinforcement. 

A further interesting difference be- 
tween eyelid and salivary conditioning 
can be predicted from such an analysis. 
We assume that the simultaneity of CR 
and UCS is essential for strengthening 
response tendencies beyond an initial 
level. In temporal conditioning the 
simultaneous occurrence of two ex; 
tremely short events such as blink and 
puff should approach zero probability 
with increasing intertrial intervals. Pre- 
cision of timing seems much less im- 
portant in the case of salivation and 
food since salivation is a cumulative 
process. As a result, we would expect 
that strengthening of the temporal 
conditioned response would be relatively 
easy for salivation and virtually im- 
possible for the eyeblink. 
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SUMMARY 


Three groups were given eyelid conditioning 
trials with interstimulus intervals of .5, 1.0, or 
1.5 sec. The group conditioning at the .5-sec. 
interval was subdivided into two groups, each 
of which was further conditioned at one of the 
other intervals. Ranges of latencies defining 
CR’s for each of the interstimulus intervals 
were established by considering the ranges of 
response latencies which showed a consistent 
increase in level of responding with practice. 
It was discovered that the latency of the response 
tended to increase with conditioning and that 
the percentage of blinks which succeeded in 
avoiding the puff also increased. This fact was 
discussed with reference to the usual assumption 
and evidence that latency decreases in the course 
of conditioning and an attempted reconciliation 
was made in the empirical conflict. Using the 
definitions of CR’s from the preceding analysis, 
learning curves were constructed which showed 
that, after the shift from short to long inter- 
stimulus interval, the responses’ appropriate 
to the short interval showed a gradually dimin- 
ishing level while, at the same time, the level 
of responding of responses appropriate to the 
long interval increased at the same rate as 
for the groups having no conditioning expe- 
rience. 
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EFFECTS ON VERBAL LEARNING OF ANXIETY, 
REASSURANCE, AND MEANINGFULNESS 
OF MATERIAL! 


IRWIN G. SARASON 


University of Washington 


Several studies in recent years have 
contributed to our knowledge of the 
effects of anxiety as measured by 
questionnaires on the performance of 
Ss in stressful and nonstressful (neu- 
tral) situations (e.g., 2, 4, 10). The 
following are a few of the generaliza- 
tions one might make on the basis 
of some of this experimental evidence. 

Under conditions of personal threat 
induced by experimental instructions 
(e.g., motivational instructions or 
failure reports) high-anxious Ss tend 
to perform at a lower level than do 
low-anxious Ss on tasks such as those 
often used in verbal learning experi- 
ments (4, 5, 10, 11, 12). Under 
neutral, nonthreat conditions (to the 
extent that such conditions can be 
experimentally created) high-anxious 
Ss appear to perform on at least as 
high a level as do low-anxious Ss on 
similar tasks (4,5, 10,11,12). There 
is a suggestion that different anxiety 
scales, e.g., test and general anxiety 
questionnaires, may lead to different 
kinds of relationships in certain kinds 
of situations (8, 13). It may well 
be that the more specific the anxiety 
scale is to situations which are similar 
to the experimental situation, the 
more will the measure of anxiety 
be related to performance. 

The evidence for these generaliza- 
tions also provides a basis for an 


1This research was supported in part by 
grants from the Agnes H. Anderson Research 
Fund and the Graduate School Research Fund 


of the University of Washington. The writer 
is indebted to Robert E. Farley for his assistance 
in carrying out this experiment. 


initial attempt to conceptualize the 
variable or variables being tapped by 
anxiety scales. High-anxious Ss may 
differ from other Ss in the extent 
to which their performance is dis- 
rupted by task-irrelevant responses 
when they perform under stress 
(2, 5, 16). It may be conjectured 
that there are at least two possible 
classes of interfering responses: (a) 
self-verbalization, e.g., “This test is 
too hard for me,” “I am stupid,” 
and (b) autonomic anxiety responses, 
e.g., accelerated heart rate, sweating, 
etc. Since these hypothesized inter- 
fering responses are assumed to be 
elicited to a much greater degree in 
high-anxious Ss than in low-anxious 
Ss under conditions of personal threat, 
and if they are indeed “interfering” 
responses, the relatively greater dis- 
ruption in the performance of high- 
anxious Ss after personal threat would 
be expected. Under nonthreatening 
conditions the arousal of interfering 
responses would not necessarily be 
expected either in high- or low-anxious 
Ss. 

Present knowledge of the ante- 
cedent conditions of such interfering 
responses is very meager. If, how- 
ever, it is assumed that the interfering 
responses aroused under certain moti- 
vation conditions are learned re- 
sponses, it seems fruitful to explore 
the conditions which would lead to 
the extinction of those interfering 
responses. This was the principal 
purpose of the present experiment. 

If high- and low-anxious Ss are 
placed in a learning situation and 
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are given only the instructions neces- 
sary to communicate how they are to 
carry out the task, important dif- 
ferences between high- and _ low- 
anxious Ss would not necessarily be 
expected. However, the high-anx- 
ious Ss may, even so, interpret the 
experimental situation as personally 
evaluative. Therefore, it can be 
inferred that even under neutral 
conditions high-anxious Ss emit re- 
sponses which could compete with 
task-appropriate responses. Available 
data indicate that under neutral 
conditions high-anxious Ss appear to 
do as well as low-anxious Ss on many 
intellectual tasks (5S, 10, 11, 12). 
Therefore if such interfering responses 
were reduced in number or strength 
prior to performance, high-anxious Ss 
should perform at a higher level than 
would low-anxious Ss. The present 
experiment attempts to reduce the 
effects of interfering responses by the 
use of “reassurance”’ instructions. 
Another important variable in 
evaluating the effects of anxiety and 
experimental instructions on _ per- 
formance is the nature of the task (3, 
4, 6). A very difficult task, for 
example, may lead to a greater decre- 
ment in performance for high-anx- 
ious Ss than for low-anxious Ss, not 
because of the task’s difficulty per se 
but because a more difficult task may 
arouse much more anxiety (i.e., more 
interfering responses) than would a 
simple task for high-anxious Ss (9). 
Also, some evidence suggests a pos- 
sible interaction between anxiety and 
difficulty of task as defined, for 
example, by the strength of com- 
peting responses in a list of nonsense 
syllables (3, 4,6). As a consequence 
of these considerations, two levels of 
difficulty of task were employed in the 
present experiment. Thus, there were 
three variables in our experiment: 
(a) anxiety, high and low scorers; (5) 
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instructions, reassurance and neutral; 
and (c) task difficulty, two lists of 
words differing in their meaning- 
fulness. 

Two instruments were used to 
assess anxiety. One instrument was a 
Test Anxiety Scale (TAS) developed 
by the writer (14); the other instru- 
ment was Bendig’s short form of the 
Taylor Manifest Anxiety Scale (MAS) 
(1). These two scales were used to 
assess the importance of the speci- 
ficity of anxiety scales mentioned 
earlier. If the anxiety scale most 
closely related to the kinds of anxie- 
ties which might be aroused in a 
given situation leads to the most 
meaningful results, then the present 
results would be expected to more 
closely adhere to the stated expecta- 
tions when Ss were categorized in 
terms of TAS than when categorized 
in terms of the MAS which contains 
no items specifically concerned with 
reactions to test situations. For 67 
college women and 309 college men, 
Pearson product-moment correlations 
between TAS and MAS of .49 and .41, 
respectively, have been reported (15). 


MeEtTHOoD 


Subjects.—The Ss were 76 students taking an 
introductory psychology course at the Univer- 
sity of Washington. For the TAS scores the 
mean, median, and SD of the obtained distribu- 
tion were 5.45, 5.61, and 3.23, respectively. 
The range was 0-16. The Ss with TAS scores 
of 6 and over were assigned to high-TAS groups, 
and Ss with scores below 6 were assigned to low- 
TAS groups. Since there were more Ss in the 
groups receiving standard instructions than 
those receiving reassurance instructions, it 
became necessary to randomly discard 12 Ss 
from the former groups in order to have equal 
Ns of 8 Ss per treatment group. This was done 
in order to carry out the analyses of variance 
involving repeated measurements to be described 
below. 

For the same reason, 12 Ss were randomly 
discarded for the MAS analysis of variance. 
The mean, median, and SD of the MAS score 
distribution were 4.75, 4.47, and 3.65, respec- 
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TABLE 1 


Lists or DissyLLaBLes DrrrerInGc tn MEANINGFULNESS AS 
Derinep By Nosie’s m VALUES 








High Meaningfulness 














| 

.99 GOJEY 1.28 NARES 8.98 MONEY | 6.24 INCOME 
1.04 NEGLAN | 1.28 ZUMAP 8.33 DINNER | 602 HUNGER 
1.79 BRUGEN 1.34 NOSTAW 7.95 OFFICE | 5.98 REGION 
1.24 XYLEM 1.30 POLEF 5.47 KEEPER | 5.61 MALLET 
1.24 TAROP 1.39 BODKIN 7.70 JELLY | 5.94 LEADER 
1.74 DAVIT 1.26 LATUK 7.39 INSECT | 5.52 KENNEL 
1.22 BALAP 1.13 BYSSUS 7.12 ZEBRA | 8.12 WAGON 

1.73 MATRIX | 1.55 KUPOD 7.58 JEWEL | 5.13 EFFORT 
1.27 GOKEM Mean = 1.34 6.57 CLE | Mean = 6.80 

tively. The range was 0-16. The Ss assigned anxiety scores at the time of running the verbal 


to high-MAS groups had scores of 5 and greater, 
while low-MAS Ss had scores of 4 and below. 

Test materials —The Test Anxiety Scale 
(TAS) consists of 17 True-False items dealing 
with S’s reactions before, during, and after 
taking examinations (14). The present TAS 
represents a transformation of the graphic 
rating scale Test Anxiety Questionnaire (5) 
into true-false terms. 

The Taylor MAS measure used was Bendig’s 
20-item short form (1). Bendig has presented 
evidence which indicates that his 20-item MAS 
scale possesses slightly less reliability than does 
the 50-item MAS but probably contains the 
20 most valid items of the 50 Taylor MAS items. 

Learning materials —In order to manipulate 
the variable of task difficulty two lists of verbal 
stimuli differing in meaningfulness as defined by 
Noble (7) were used. Table 1 presents the two 
17-item lists employed. The mean m value for 
the high-meaningful list was 6.80, and for the 
low-meaningful list, 1.34. 

Procedure.—Several weeks prior to participat- 
ing in the verbal learning, all Ss had taken the 
two anxiety scales. The Ss were given no basis 
for assuming a relationship between the anxiety 
scales and the verbal learning. 

Each list of 17 dissyllable words was exposed 
on a memory drum with a 2-sec. exposure per 
word and 6-sec. between trials. Each S re- 
ceived 25 trials on the appropriate list.2 Learn- 
ing was by the anticipation method, asterisks 
served as cues for anticipating the first word, 
and Ss were never informed about the criterion 
being used to determine the termination of the 
learning task. The E was unaware of Ss’ 


2In those cases where S had five consecutive 
perfect trials E ended the experimental session 
at the end of the fifth consecutive perfect trial 
and S was assigned perfect scores for the 
remainder of the 25 trials. 


learning study. 

The following instructions were given to all 
Ss upon entering the experimental situation: 
“You will see words appearing in this opening 
one at a time. After a word is presented, call 
out the next one before it appears. Of course, 
the first time through the list you won’t be able 
to anticipate any words, but after that call out 
the word before it appears in the window. 
Prior to the beginning of each new trial there 
will be a short rest in which you will see blank 
spaces in the opening. Asterisks will indicate 
that the first word in the list will appear next. 
When you see them call out the first word. 
Do you understand?” 

The Standard instructions groups received 
only those instructions; the Reassurance in- 
structions groups received in addition the follow- 
ing statement from E prior to being administered 
the standard instructions: “Before we start, 
perhaps I could mention a few things that will 
be helpful to you in the learning of the list I am 
going to show you on this memory drum. 
Many people get unduly upset and tense because 
they do not learn the list in just a few trials. 
If you don’t worry about how you are doing but 
rather just concentrate on the list, you will find 
you learn much more easily. These kinds of 
lists are hard and so it’s no surprise or matter of 
concern if you progress slowly at first and make 
mistakes.” 

A person other than E was responsible for 
assigning Ss to the experimental treatment. 


RESULTS 


The data for the 25 trials were 
grouped into 5 blocks of 5 trials each. 
The score used in the analyses of 
variance was the number of correct 
responses per block. 
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TABLE 2 
Mean Number or Correct Anticipations For 5 Biocks or 5 Trias Eacu 
For Ss CaTEGoRIZzED oN Basis or Test Anxiety Scare (TAS) 
High TAS Low TAS 
Trial Reassurance | Standard | Reassurance Standard 
Block = | | 
High High Low High | Low | High Low 
Meaning *... de Meaning Meaning Meaning Meaning Meaning Meaning 
1 1738 | 9.88 | 20.88 5.75 | 16.75 3.88 | 23.63 | 5.00 
2 | 40.50 25.13 44.63 13.00 34.50 14.88 52.88 | 19.13 
3 | 60.50 36.38 66.63 23.38 56.38 27.50 | 72.00 | 36.38 
4 75.25 49.25 78.13 33.38 64.00 36.00 81.63 51.25 
5 82.13 54.63 79.38 38.25 71.38 43.88 | 80.75 63.88 
i i 








Test Anxiety Scale-——Table 2 '’pre- 
sents the mean number of correct 
anticipations for the eight experi- 
mental groups for the five blocks of 
trials. 
of the 2 X 2 KX 2 & 5 repeated meas- 
urement analysis of variance per- 
formed on the TAS data. As might 
be expected, the F for meaningfulness 
was highly significant (P < .001) 
with a greater number of correct 


TABLE 3 


Summary oF TAS Anatysis oF VARIANCE OF 
Correct ANTICIPATIONS FOR 5 
Biocks or 5 Triats Eacu 








Source MS F 
Test Anxiety (TAS) 1 31 — 
Meaningfulness (M) 1 |55,862.45 | 77.25*** 
Instructions (I) 1} 973.01} 1.35 
TAS X M l 117.61; — 
TAS XI 1| 5,445.00 153°" 
Mx I 1| 1,757.81 2.43 
TASX MxI ro 649.75 - = 
Error (b) | 56| 723.01 
Trial Blocks (T) 4 |28,422.19 | 460.88*** 
TAS X T a — 
MxXT 4) 993.87 16.12*** 
‘xT 4 26.50 ae, 
TAS X MX T 4| 162.99| 2.64* 
TAS X IX T 4) 2.11); — 
IXMXT 4| of 8 ee 
TASXMXIXT)| 4] 366.83/ 5.95*** 
Error (w) | 224 | 61.67 
*P = 05. 
=P = Ol. 
7? P = 001. 


Table 3 presents the results . 


anticipations given to the high- than 
to the low-meaningful lists. As ex- 
pected, a significant F (P < .01) 
was obtained for the TAS X Instruc- 
tions interaction. As Table 1 indi- 
cates, high-TAS Ss performed at a 
higher level under the Reassurance 
than under the Standard instruc- 
tional condition. Low-TAS Ss, on 
the other hand, performed at a higher 
level under the Standard than under 
the Reassurance conditions. Thus, 
the differential instructions had op- 
posite effects for high- and low-TAS 
Ss. 

The Trials X Anxiety X Meaning- 
fulness interaction (P < .05) was 
largely attributable to differences be- 
tween high- and low-TAS groups on 
the high- and low-meaningful material 
for the fifth block of trials. On that 
block for high-meaningful groups, 
high-TAS Ss were superior to low- 
TAS Ss. For low-meaningful ma- 
terial on the same block, low-TAS Ss 
were superior to high-TAS Ss. 

The significant Trials X Meaning- 
fulness interaction (P < .001) indi- 
cates the learning curve for the high- 
meaningful material to be more steep 
than the curve for low-meaningful 
material early in learning and less 
steep late in learning. 
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Manifest Anxiety Scale——The re- 
sults of the 2 X 2 KX 2 x 5 analysis 
of variance performed using MAS as 
the measure of anxiety revealed only 
two significant F’s—those for Mean- 
ingfulness (P < .001) and for Trials 
(P < .001). Again performance was 
on a higher level under the high- 
meaningful list than under the low- 
meaningful list. 


Discussion 


In general, the results obtained in 
this experiment tended to support the 
stated expectations. There was no tend- 
ency toward an interaction between 
anxiety and instructions using MAS as 
the measure, but this interaction was 
significant for TAS.’ 

This finding would appear compatible 
with the assumption that high-anxious 
Ss bring with them into situations certain 
response tendencies which may interfere 
with performance. The reassurance con- 
dition seemed to reduce the effect of 
these interfering responses. 

How might one account for the detri- 
mental effect of reassurance for low-TAS 
Ss for both levels of meaningfulness? 
One fairly obvious possibility is that the 
reassurance for low-TAS Ss functioned 
to reduce their ego involvement in the 
task on which they were performing. If 
S is told that errors will inevitably occur, 
one conclusion which some Ss might 
have drawn is that efforts at reducing 
errors will be of little value. Thus, for 
low-TAS Ss, reassurance may have been 
a cue to “take it easy.” 

The present findings suggest that 
the more specific the psychometric 
measures of anxiety is to the class of 
situations in which we may be interested 


3It would. be of considerable interest to 
replicate the present study using very extreme 
high- and low-anxiety groups. On the basis of 
the present results and our interpretation of 
anxiety, we would expect that using high- and 
low-TAS groups more extreme than those used 
in this experiment would increase this inter- 
action effect. 
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the more useful apparently will the 
anxiety measure be. 

An interaction between anxiety and 
task difficulty of the type reported by 
Montague (6) was not obtained in this 
study. However, the Trials X Anxiety x 
Meaningfulness interaction and the four- 
factor interaction obtained for’ the TAS 
analysis suggest that difficulty of task 
and anxiety may be complexly related 
to each other. 


SUMMARY 


Effects on performance in verbal learning were 
measured for (a) anxiety as defined by Test 
Anxiety Scale scores and Taylor MAS scores 
(Bendig revision), (5) meaningfulness of ma- 
terial (Noble’s m values), and (c) experimental 
instructions. The experimental instructions 
were: (a) standard instructions (S merely in- 
formed of the required manner of response to the 
memory drum), and (b) reassurance instructions 
(S informed that many errors were to be ex- 
pected on the type of task on which he was 
performing and that he should not worry about 
making many errors). In a repeated measure- 
ment analysis of variance design the effects of 
the three experimental variables and _ their 
interactions with trials were assessed. 

The results showed that for Test Anxiety, 
there was a significant interaction between Test 
Anxiety (high and low) and the kind of instruc- 
tions (reassurance and standard) administered 
toS. It was found that the reassurance instruc- 
tions were relatively facilitative for high Test 
Anxiety Ss and relatively detrimental for low 
Test Anxiety Ss. Under the standard instruc- 
tional condition the performance of low Test 
Anxiety Ss appeared to be facilitated whereas 
just the opposite seemed to be true for high Test 
Anxiety Ss. When the data were analyzed in 
terms of Taylor MAS, there were no effects 
involving anxiety which attained significance. 
The results were discussed in terms of an inter- 
fering response interpretation of anxiety. 
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THE RELATIONSHIP OF STIMULUS SIMILARITY 
AND NUMBER OF RESPONSES! 


JACK RICHARDSON 
Harpur College 


The experimental tasks reported 
as examples of concept formation 
vary in many respects but seem 
to have two common characteristics 
which also serve to distinguish them 
from rote-learning tasks. The same 
response is correct for more than one 
stimulus and the stimuli which are 
designated by identical responses 
are discriminatively different but are 
“similar” in some way (have com- 
mon elements, belong to the same 
class, etc.) while the stimuli desig- 
nated by different responses are less 
similar. 

Although verbalization of the basis 
of classification is often used as the 
criterion of concept attainment, there 
are two major methods of presenting 
concept examples and each has an- 
other criterion of concept learning or 
attainment. The first method pre- 
sents single examples of each concept 
until the correct responses are learned 
and then presents a different example 
of each concept and requires the same 
response (e.g., 1, 7, 9). The cri- 
terion of concept learning is the 
tendency for new examples of a 
concept to evoke the same response 
as the previous examples. This type 
of concept study resembles rote- 
learning studies of transfer in paired 
associates where the stimuli are simi- 
lar and the responses identical (e.g., 


'This research was supported by Grant 
G2594 from the National Science Foundation to 
the Research Foundation of State University of 
New York. Completion of this study was made 
possible by the cooperation of Paul Gimmie, 
Charles Traphagen, Malcolm Piester, and the 
students of Vestal Central School who served 
as Ss. 
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4, 15). Positive transfer from A-B 
to A’-B is expected in both, but A 
and A’ are examples of the same 
concept in the concept studies while 
in the transfer studies A and A’ have 
been rated as similar. 

The second method presents several 
examples of a concept without re- 
quiring the correct response for the 
previous examples (e.g., 3, 10, 11). 
The criterion for concept learning 
is the tendency for each example of 
the concept to evoke the same re- 
sponse. In these studies it would be 
possible to learn the correct response 
to each concept example independ- 
ently, but it is assumed that assigning 
the same response to each example 
of a concept facilitates learning. 
Apparently there is positive transfer 
between examples of the same con- 
cept. To apply this method to 
rote-learning materials would require 


lists in which similar stimuli are 
paired with the same response. 
There is some indication that 


similarity ratings are useful in pre- 
dicting the relative difficulty of con- 
cepts (13). If stimulus similarity 
and number of responses are relevant 
to the differences between rote learn- 
ing and concept learning then it 
should be possible to produce results 
characteristic of concept learning by 
manipulating rote-learning materials. 
The results should conform to the 
laws of transfer in verbal learning 
and at the same time show the 
faster learning and apparently better 
retention of concepts (10, 11, 12). 

The present study is concerned with 
paired-associate lists in which the 

















STIMULUS SIMILARITY AND NUMBER OF RESPONSES 


number of stimuli were held constant 
while the similarity of the stimuli 
and the number of different responses 
were varied. Of course, the im- 
portant thing is the relationship be- 
tween the stimulus similarity and 
the number of responses rather than 
either isolated from the other. One 
of the lists resembles the second 
method of concept presentation in 
that highly similar stimuli are desig- 
nated by the same response while 
there are different responses for low- 
similarity stimuli. 


MeTHOD 


Lists.—Nine lists of 16 pairs were constructed 
with adjectives from Haagen (6) as stimuli and 
nonsense syllables from Glaze (5) as responses. 
The only differences in the lists were the number 
of different syllables used as responses and the 
similarity of the stimuli which were paired with 
a response. The design may be considered 
factorial with three different numbers of re- 
sponses and three stimulus similarity relation- 
ships. Low-similarity adjectives were used as 
stimuli for three of the lists and consisted of 16 
adjectives with no apparent or rated similarity. 
The high-similarity adjectives consisted of four 
groups of four highly similar items and were 
used as stimuli in six of the lists. The nonsense 
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syllables were of either 47% or 53% association 
value, of minimum formal similarity, and either 
2, 4, or 8 different syllables were correct as 
responses for all 16 stimuli in a list. 

The nine lists are presented in Table 1 with 
the high-similarity stimuli in Column 1, the 
low-similarity stimuli in Column 2, and the 
responses which were paired with the stimuli in 
Columns 3-8. The headings of the columns 
containing the responses designate the different 
lists with the number indicating the number of 
different responses in the list and the letters 
indicating that the responses in that column 
were paired with the low (L) or the high- 
similarity (Ha and Hr) stimuli. For each dif- 
ferent number of responses there were two 
pairings with the high-similarity stimuli. In 
one there was high intraresponse stimulus 
similarity (Ha), i.e., the responses were assigned 
so there was maximum similarity between 
stimuli paired with the same response. In 
the other there was high interresponse stimulus 
similarity (Hr), i.e., there was maximum simi- 
larity between stimuli paired with different 
responses. 

It should be noted that only in the lists 
with four responses is it possible to assign a 
different response to each group of similar 
adjectives or to assign a single adjective from 
each similarity group to one response. 

Procedure-—The 16 pairs in each list were 
arranged in three random orders and presented 
with a 16-mm. strip-film projector. The pairs 
were flashed from the back of a screen so that 
S could not see either E or the projector during 























TABLE 1 
Lists 
Stimuli Responses 
Item > oe a tos. —— 
High Low | oz 2-Hr om | +Hr x 8-Hr 
1 | DOUBLE | VERBAL BIH | BIH | BIH | BIH | BIH | BIH | 
2 | DUAL | TWOFOLD | BIH ZET BIH VOM BIH SUJ 
3 | PAIRED |} HALLOWED BIH | BIH BIH | RUK SUJ | VOM 
4 | TWOFOLD | DARING BIH | ZET | BIH | ZET SU] NAC 
5 | SACRED | PERFECT BIH BIH | VOM BIH VOM | BIH 
6 | DIVINE | DIRTY | BIH | ZET | VOM | VOM | VOM | SUJ 
7 | HALLOWED | REMOVED | BIH BIH | VOM RUK NAC VOM 
8 | HOLY RISING | BIH ZET | VOM | ZET NAC | NAC 
9 | SPOKEN | ROBUST ZET BIH | RUK BIH RUK | RUK 
10 | ORAL FLUID | ZET | ZET RUK VOM RUK YAD 
11 VERBAL PLAYFUL ZET BIH RUK RUK YAD ZET 
12 | VOCAL GLEAMING | ZET | ZET | RUK | ZET | YAD | QIX 
13. | TRANQUIL URGENT ZET | BIH | ZET | BIH | ZET | RUK 
14 | PEACEFUL OPEN ZET ZET ZET VOM | ZET YAD 
15 | QUIET STEADY ZET BIH ZET RUK | QIX ZET 
16 | SERENE | LITTLE | ZET ZET ZET ZET QIX QIX 
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the presentation. The stimulus of each pair 
was presented for 2 sec. followed by a 2-sec 
presentation of the stimulus and _ response 
together. There was a 4sec. intertrial interval. 

Twenty-four different Ss learned each list; 
12 Ss were college students from introductory 
psychology courses and 12 were junior or senior 
high school students. The Ss from each of the 
two groups of students were assigned to lists 
as they appeared at the laboratory. None of 
the Ss had previously served in a learning experi- 
ment. They were required to spell the syllables 
and were given the usual instructions for paired- 
associate learning by the anticipation method. 
They were not told that the same response would 
be correct for more than one stimulus or that 
the stimuli were grouped in any way for the 
assignment of responses. 

To avoid selection due to loss of Ss and to 
approximately equate performance at the end 
of the first session, the lists were presented for a 
specific number of trials. The lists with two 
different responses were presented for 5 trials, 
those with four different responses for 10 trials, 
and the lists with 8 responses for 15 trials. 
After learning for the appropriate number of 
trials, Ss were dismissed and returned 48 hr. 
later for the second session. Since differential 
transfer was expected between items in the lists, 
it was important that the responses should not 
be presented during any test of recall. To 
control for intralist transfer and for warm-up, 
the first five trials of the second session were 
recall trials. During the five recall trials the 
lists were presented in exactly the same way 
as during learning except that the responses were 
not presented and there was no indication to S$ 
whether a response was right or wrong. The 
Ss were told that the responses would not appear 
for a few trials but that they should continue 
to spell as many responses as possible each time 
through the list. After the five recall trials 
the responses were again presented in the 
usual fashion and relearning continued to a 
total of 25 trials for both sessions or to a criterion 
of three successive perfect trials. For purposes 
of the analysis it was assumed that all responses 
would be correct after three successive perfect 
trials. 


REsSULTs 


Learning.—The mean number of 
correct responses on Trial 25 of each 
list is given in Table 2 and the 
analysis of variance is presented in 
Table 3. The college students are 


superior to the high school students, 
but there is no significant interaction 
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TABLE 2 
Mean Numser or Correct RESPoNnsES 
on Triax 25 
College Students High School Students 
ana No. of Responses No. of Responses 
2 4 ew. 4 | 8 
Hr | 12.08} 9.17) 8.17/11.25)| 8 8.50} 8.92 
L 14.17 | 12.00 | 10.92 | 11.25 | 11.17 1 11.50 
Ha | 14.17| 15.33 12.00 | 13.67 |1 3.17| 7.83 
Com- | 13.47 | 12.17 10.36 | 12.06 | 10.94 9.42 
bined 














of students with the other variables. 
The two groups are combined in 
Fig. 1 and the mean number of cor- 
rect responses on each trial is pre- 
sented for each list. The first break 
in each curve indicates the 48-hr. 
retention interval, and the second the 
end of the five recall trials when the 
responses were again presented. 
Number of different responses is 
highly significant, but this analysis 
does not take into account the dif- 
ferential probability of guessing the 
correct response. The appreciable 
number of times Ss did not respond 
makes the value of a correction for 
guessing doubtful. However, the cor- 
rection used by Bricker (2) was ap- 


TABLE 3 


ANALYsIs oF VARIANCE FOR THE NUMBER 
or Correct RESPONSES ON 























Triat 25 
Source af MS F 

Students (St) 1 77.04 | 7.48* 
No. of Responses 2 | 150.04 | 14.58* 

_ ONR) 
Similarity (Sm) 2 | 173.51 | 16.86* 
NR X Sm 4 37.83 | 3.67* 
St X Sm 2 18:76 | 1.82 
St X NR 2 1.01 10 
St X NR X Sm 4+ 21.17 | 2.06 
Within 198 10.29 

*P = 01. 
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Fic. 1. Mean number of correct responses 
po 


on each list as a function of trials. 


plied to Trial 25 and an analysis of 
variance was computed on the cor- 
rected scores. This decreased the F 
for number of responses to .97 and 
reduced the F for the interaction 
between number of responses and 
similarity to 2.98. The F’s for Simi- 
larity and for Students increased to 
17.51 and 8.82, respectively. This 
correction probably overestimates the 
effect of guessing, but there are 
other indications that number of 
responses may not be an effective 
variable in learning. In Fig. 1 it is 
apparent that the performance level 
expected from guessing alone is at- 
tained in approximately the same 
number of trials for lists with different 
numbers of responses and that the 
differences between the 2-, 4, and 
8-response lists tend to decrease after 
this point. There is also a significant 
interaction between number of re- 
sponses and similarity with the small- 
est differences between the low- 
similarity lists. In view of this, the 
significant effect of number of re- 
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sponses is attributed to guessing and 
to similarity relationships. 

The over-all effect of similarity 
is highly significant with the high 
interresponse stimulus similarity lists 
being more difficult, and the high 
intraresponse stimulus similarity lists 
less difficult, than the low-similarity 
lists. However, the significant inter- 
action indicates that the effect of 
similarity is not independent of the 
number of responses. Similarity as 
it is used here is a dimension only if 
considered in relation to the responses. 
If the number of stimuli remains con- 
stant then the relationships in the 
Ha and Hr lists must change as the 
number of responses change. If we 
consider intralist transfer as a function 
of the similarity of the stimuli then 
there would be little or no transfer 
in the low-similarity lists while each 
stimulus in the high-similarity list 
would produce transfer to three 
others. The direction of transfer 
would depend on whether the three 
stimuli had the same or different 
responses as the stimulus which is 
producing the transfer. Thus the 
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Fic. 2. Mean number of correct responses 
on Trial 25 as a function of the number of items 
within a similarity group to which the transfer 
would be positive or negative. 
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high-similarity lists would have either 
negative transfer to three items within 
a similarity group (Lists 4Hr and 
8-Hr), positive transfer to one and 
negative to the other two (Lists 
2-Hr and 8-Ha), or positive transfer 
to three items (Lists 2-Ha and 4-Ha). 
In Fig. 2 the number of correct re- 
sponses on Trial 25 is plotted as a 
function of the amount of transfer 
expected. The abscissa indicates the 
number of items, within a single 
similarity group, to which transfer 
is expected to be positive or negative. 
There is a separate curve for each of 
the three different number of responses 
and the similarity is indicated at 
each point. It is clear that the num- 
ber of correct responses increases 
as the number of items to which 
positive transfer is expected increases. 
This effect is not dependent on the 
number of different responses in the 
list. 

Retention.—In Fig. 1 it is apparent 
that there was an appreciable loss 
on all lists from the last learning 
trial to the first recall trial and, 
even though Ss were given no in- 
formation as to the correctness of 
their responses, there was a gain 
over the five recall trials on all except 
List 2-Hr. For a statistical evalua- 
tion a count was made of the num- 
ber of Ss who had more correct 
responses and the number of Ss who 
had fewer correct responses on the 
second of the two trials being com- 
pared. Computation of x*’s_ indi- 
cated that a significant number de- 
creased over the 48-hr. retention 
interval, increased from Recall 1 to 
Recall 5, and that there was not a 
significant number of Ss who de- 
creased from the last learning trial 
to the fifth recall trial. There was a 
tendency for more loss over the reten- 
tion interval in the low-similarity 
lists but there was also more gain in 
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TABLE 4 , 


PERcENTAGE OF Retention Arter 48 Hr. 
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Recall 5 


87.2 
95.6 
89.0 
117.8 
106.4 
104.3 
93.3 
98.6 
92.8 


Relearn 1 


100.5 
102.0 
984 
105.1 
98.1 
102.4 
92.0 
96.6 
98.9 
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these lists over the five recall trials. 
There were no systematic differences 
in retention as a function of stimulus 
similarity relationships with the re- 
sponses. The percentage of the num- 
ber correct on the last learning trial 
which was correct on the first and 
fifth recall and the first relearning 
trial is given for each list in Table 4. 
It should be noted that retention is 
uniformly high and that it increases 
from Recall 1 to Recall 5. 

Errors —The errors on the last 
learning trial, the first and fifth recall 
trials, and the first and second re- 
learning trials were classified as omis- 
sions (failure to make an overt re- 
sponse), errors from within the list, 
or others which included partial 
responses and combinations of letters 
which were not correct for any item 
in the list. There were no systematic 


TABLE 5 


Totat Numser or Errors in 
Eacu CLASSIFICATION FOR 
Aut Lists Comsinep 











433 
1130 

















244 96 
1807 1786 
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differences in the distribution of 
types of errors as a function of 
similarity. Although there were many 
more errors in the Hr groups, the 
percentage of these that were from 
within the list was no greater than 
in the Ha groups. The number of 
different types of errors in the five 
trials for all lists combined are given 
in Table 5. The increase in correct 
responses over the five recall trials 
is primarily the result of the decrease 
in the number of omissions. 


Discussion 


It is apparent from Fig. 2 that rote- 
learning transfer principles predict the 
relative speed of learning when similarity 
relationships are varied in relation to the 
responses. The higher the similarity 
between stimuli with the same response 
the faster the learning, and the higher 
the similarity between stimuli with 
different responses the slower the learn- 
ing. The positive intralist transfer com- 


monly found in concept learning can be 
produced by manipulating stimulus simi- 


larity. List 4-Ha is most comparable 
to the concept lists since there is one 
response for each group of similar stimuli. 
It is here that the difference between 
the Ha list and the L list is greatest. 
Retention does not seem to be related 
to similarity and this agrees with other 
studies in which stimulus similarity 
has been varied (14). However, the 
retention is high. If one considers only 
the 4-Ha list, there is 94.9% retention 
on the first recall trial and this increases 
to 104.3% on the fifth recall trial. 
Differences in number of items, degree 
of learning, retention interval, and 
procedure prohibit direct comparison 
with concept studies (10, 11, 12), but 


the retention of concepts is not obviously — 
In view of the high degree of : 


better. 
retention in this study it seems that a 
direct comparison of concept and rote 
materials which used the same procedure 
and equated the degree of learning might 
show no differences in retention. 

The five recall trials during which the 
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responses were not presented resulted 
in an increase in the number of correct 
responses on all lists except 2-Hr and the 
small increase in the 2-response lists 
may be due to the fact that the per- 
formance at the end of learning was at 
approximately the guessing level. This 
procedure seems similar to warm-up 
before recall found by Irion (8) but 
differs in that it is practice on the same 
task rather than some other task. On 
the other hand, it cannot be considered 
learning in the usual sense because there 
was no information given as to the 
correctness of the response. In Table 
4 there seems to be little relation of the 
percentage of increase over the five 
recall trials with either similarity or 
number of responses. The increase is 
definitely related to the number of trials 
on recall, but the limit cannot be deter- 
mined from this experiment. Further 
experimentation is required to relate 
this phenomenon to warm-up and to 
determine what aspects of the procedure 
are necessary to produce it. 


SUMMARY 


This study varied the number of different 
responses and the similarity of the stimuli in 
relation to the responses in paired-associate 
lists. In 16-item lists either 2, 4, or 8 nonsense 
syllables were used as responses and either high- 
or low-similarity adjectives were used as stimuli. 
The similarity of the stimuli which had the same 
response was varied. 

It was found that similarity was a variable 
in learning and that the list with a single re- 
sponse for each group of similar adjectives 
showed high positive intralist transfer charac- 
teristics of concept learning. Retention did 
not seem to be related to similarity, but was 
uniformly high. 
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The well known fact that meaning- 
ful verbal material is easier to memo- 
rize than nonsense material is usually 
explained by saying that meaningful 
material permits more positive trans- 
fer from previous learning; S has 
already learned a great deal about the 
meaningful material before the experi- 
ment begins. Exactly what the S 
has already learned that he can 
transfer to the memorization of mean- 
ingful material is usually not speci- 
fied, however. The positive transfer 
may be attributable to something 
about meaning per se that makes the 
task easier, or it may be attributable 
to the fact that meaningful verbal 
material usually obeys certain familiar 
rules of formation, or both. In order 
to study this question, Miller and 
Selfridge (5) used statistical approxi- 
mations to English which Ss were 
required to reproduce after one hear- 
ing; they came to the conclusion that 
familiarity with the rules which 
govern the sequential properties of 
meaningful passages is probably much 
more important in facilitating recall 
than is S’s understanding of the 
passage. This conclusion implies 
that we should take a closer look 
at the effects of nonrandom rules of 
formation. 

Aborn and Rubenstein (1, 6) initi- 
ated such an exploration of the effects 
of rules in a series of studies that 
used sequences of nonsense syllables. 


1 This research was supported under Contract 
AF 33(038)-14343 and appears as report number 
AFCRC-TN-58-50 (ASTIA Document Number 
AD 146780). Reproduction for any purpose 
of the U. S. Government is permitted. The 
author is indebted to Elizabeth A. Friedman for 
the collection of the data. 


The sequences were constructed ac- 
cording to rules that varied from 
completely random to highly deter- 
ministic. In the terminology of in- 
formation theory (7), the sequences 
formed according to elaborate and 
restrictive rules were more redundant, 
carried less information per syllable, 
than the random sequences. It was 
found that, for Ss familiar with the 
rules of formation, the redundant 
sequences were easier to memorize, 
but not as much easier as would be 
predicted on the hypothesis that 
the amount of information acquired 
is invariant under changes in the 
rules of formation. These results 


confirm and extend the conclusion 
drawn by Miller and Selfridge and 


do so in a situation where meaningful 
words are eliminated and E has 
complete knowledge of the rules of 
formation and of S’s prior experience 
with those rules. 

The particular sets of rules used by 
Aborn and Rubenstein were of a 
simple type which Chomsky (2) has 
referred to as “‘finite state grammars.” 
The grammar of English cannot be 
stated in such simple terms, but 
finite state systems are quite con- 
venient for the experimental study 
of the effects of rules on memoriza- 
tion. Chomsky and Miller (3) have 
described the formal properties of 
finite state languages and their re- 
sults are used below to describe the 
materials in the present study. In 
spite of their formal simplicity, these 
finite state systems pose some com- 
plicated psychological problems. We 
would like to know what changes, if 
any, in a set of rules leave performance 
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invariant, and so we are faced with 
a large and tangled set of possible 
relations between the formal aspects 
of the finite state systems and the 
empirical aspects of S’s behavior. 
The present study does little more 
than illustrate the complexity of this 
formidable task. 


CHARACTERIZATION OF REDUNDANT 
STRINGS 


All strings of symbols used in these ex; eri- 
ments were composed of four letters of th 
alphabet (G, N, S, and X) chosen randomly 
from the consonants. With these four letters 
it is possible to form 4 different strings of letters 
of Length A. A finite state generator is an 
algebraic system that selects a particular subset, 
the well-formed strings, out of the set of all 
possible strings (2, 3). The particular finite 
state generator used here is displayed in graphical 
form in Fig. 1, where the numbered circles 
represent the states of the system, the arrows 
represent the permissible transitions from state 
to state, and the letters associated with each 
arrow represent the symbols generated in 
moving from one state to the next. All well- 
formed strings are generated by starting in 
State 0 and following any path through the 
diagram that ends on the first visit to State 0’. 
Thus, the generator selects the well-formed 


RULES OF TRANSITION: 


(O,N,1) 
(0,$,3) 


Fic. 1. Diagram of the finite state generator 
of the redundant strings used in this experiment. 
An admissible string is any sequence of letters 
generated by starting in State 0 and ending on 
the first arrival at State 0’. An equivalent, 
algebraic description of the generator is given 
in the rules of transition beneath the diagram; 
each rule gives the state of the system in which 
the rule is applicable, the symbol generated in 
transition, and the state in which the system 
is left after the rule is applied. 


(2,6,0) 
(2,%,.) 


(14,N,3) 
(1,6,0) 


(3,8,2) 
(3,%,3) 
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strings: NG, SSG, NNSG, SSXG, SXSG, 
NNSXG, SXXSG, etc. At the bottom of Fig. | 
the rules of transition are stated explicitly; 
for example, (0, N, 1) means that when the sys- 
tem is in State 0 it can generate the symbol N 
and thus move next to State 1. A state is 
defined as the set of all rules applicable in that 
state; for example, State 2 consists of two rules, 
(2, G, 0’) and (2, X, 1). 

For this particular generator, the number F(A) 
of strings of Length A or less can be computed 
recursively from the equation F(A) = 2F(A — 1) 
— Fa — 2) + FA — 3) — FA — 4), where 
F(l) = 0, F(2) = 1, F(3) = 2, and F(4) = 52 
The largest real root of this finite difference 
equation is 1.4656, and for large 4, F(A) ap- 
proaches 1.731(1.4656)*. If we use Shannon’s 
(7) measure of channel capacity, this system 
has a capacity of 


lim logs f OY x Joge 1.4656 = 0.552 bit /letter. 


By comparison, the set of all strings formed on 
this set of four letters has a capacity of 
log: 4 = 2 bits/letter. Thus, the redundancy is 
1 — (0.552/2) = 0.724, or 72.4%. This can be 
interpreted to mean that the well-formed strings 
are 3.62 times as long as they would need to be 
in order to encode the same amount of infor- 
mation. 

It is possible to show that such a finite state 
generator can be decomposed, or “factored,” 
into a.set of simpler generators. In the present 
example, we first find all those paths between 
States 0 and 0’ which visit no state more than 
once. These paths generate the basic strings 
NG, NNSG, SSG, and SSXG._ All well-formed 
strings can be formed by inserting loops at 
appropriate points in these four basic strings. 
We symbolize a loop by placing it in parentheses, 
where the second parenthesis indicates that we 
have returned to the same state we were in at 
the first parenthesis. Using this notation, we 
can summarize the entire set of well-formed 
strings: N(N(X)SX)G, N(N(X)SX)NSG, 
$((X)(SXN))SG, and S((X)(SXN))SXG. This 
method of representing the well-formed strings 
is more compatible with the way Ss tend to 
organize them than are the representations in 
terms of graphs or transition rules. It should 
be noted that in this particular generator, the 


* The number f(A) of strings of exactly length 
d can be similarly computed from the equation 
f+ 3) = fA + 2) +f), ford = 2, 3,.. ., 
given f(1) = 0, f(2) = f(3) = 1, and f(4) = 3. 
Obviously, f(A + 1) = F(A +1) — F(A). The 
amount of information per letter computed, for 
F(A) is the same as that computed for f(A). 
(Cf. 3, p. 111.) 
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TABLE 1 


Lists or Strincs Usep 


Redundant Strings | 


Random Strings 








ND » 

NNXXSXG| GSXXGNS |NGSXXNS 
SXSXNSG NNXXXSG) NSXXGSG |NGXXGGN 
SXXXSXG |SSXNSXG | SGXGGNN|SXGXGNS 
SXXXXSG |SSXNXSG XXGNSGG|XGSNGXG 


basic strings themselves seem to group together ; 
NG is included in NNSG and SSG is included in 
SSXG, and the Ss seemed to distinguish only 
two basic types of strings. The S who notices 
these similarities among the strings can use them 
to reduce the amount of information he is 
required to remember. 


MeEtTHOD 


The generator in Fig. 1 gives 22 well-formed 
strings of Length 7 or less. Of these 22, 18 
were chosen in such a way that two lists of nine 
strings, each containing the same number of 
strings of the same length, could be constructed. 
The assignment of a string to one list or the 
other was determined by a table of random 
numbers. The result of this procedure was the 
two lists, L; and Ls, in the first two columns of 
Table 1. 

In addition, Table 1 contains two further 
lists, R; and Rg, of nine strings composed of the 
same four letters. Lists R; and Re, however, 
were constructed with the aid of a table of ran- 
dom numbers. The only constraint in their 
formation was the requirement that the R 
lists should have the same number of strings of 
each length as did the L lists. 

These strings were printed in block letters 
on 3 X 5-in. cards. The S was seated across 
the table from £, who laid the cards face up in 
front of S at a rate of one card every 5 sec. 
After all nine cards in the list had been pre- 
sented, E picked up the deck of cards and 
shuffled it while S wrote, in any order, all the 
strings he could recall. When S finished, EF 
removed the response sheet and proceeded to the 
next trial. Ten such trials were given and the 
order of the cards was varied haphazardly on 
each trial. Each S studied two lists, and at 
the end of the tenth trial on the second list he 
wrote all the strings he could recall from the 
first list. 


OF REDUNDANT STRINGS OF LETTERS 
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The Ss were undergraduate and graduate 
students at Harvard and Radcliffe Colleges 
and members of the staff of the Harvard Psy- 
chological Laboratories. The original design 
called for two Ss each to learn the lists LiL, 
Lely, RiRe, and R2R,, in that order, and one S 
to learn L,Ry, LiRs, L:Ry, LR, R,L;, Riles, 
Rela, and R212, which would have given four Ss 
on each type of pair: LL, RR, LR, and RL. 
The results with the RL condition were so 
variable, however, that it was run three times, 
giving 12 Ss in all for this condition. 

The Ss were told nothing about the method 
of constructing the lists. The same explanation 
of the procedure to be followed was given to 
every S: he was to read the nine cards and write 
down as many strings as he could remember 
on each trial. 

The response sheets were scored for the 
number of correct strings. Incorrect responses 
were ignored. 


RESULTS 


The mean number of correct re- 
sponses is shown in Fig. 2 as a func- 
tion of the trial number. The plot 
on the left gives the mean number 
correctly recalled by the eight Ss 
who learned an L list first (open 
circles) and also by the 16 Ss who 





FIRST LIST 


MEAN NUMBER CORRECTLY RECALLED 








SUCCESSIVE TRIALS 


Fic. 2. Mean number correct as a function 
of the trial number. L indicates a redundant 
list and R indicates a random list. On the left, 
the curve labeled L. represents the results on 
the first list for the eight Ss who studied lists LL 
or LR, in that order, and the curve labeled R. 
represents the 16 Ss who recéived RL or RR. 
On the right, the curves labeled LL, LR, and 
RR are based on data for the second list studied 
by four Ss each; the RL function is based on 
12 Ss. 
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Fic. 3. Individual data for eight Ss learning 
a list of redundant strings initially are given in 
the graph on the left. On the right, the indi- 
vidual data are given for the 12 Ss who studied 
the redundant strings after they had previously 
studied a list of random strings. 


learned an R list first (solid squares). 
The plot on the right in Fig. 2 gives 
the mean number correct on the 
second list for each of the four groups. 

Averaging data from several Ss 
gives a poor picture of what happened 


in the RL condition. Figure 3 shows 
individual data: the left plot shows 
the data for the 8 Ss who learned an 
L list first, and the right plot shows 
the data for the 12 Ss who learned an 
L list after they had studied an R 
list. The effect of prior study of an 
R list is to increase the variance on 
the L list. 

When Ss were asked, after study- 
ing a second list, to recall the first 
list they had studied, the average 
number of strings recalled correctly 
was less than the number recalled 
on the tenth trial with the first list. 
In the LL condition, the average 
recall dropped from 8.75 strings on 
the last trial with the first list to 5.0 
strings on the retest. In the LR 
condition it dropped from 9.0 to 5.0. 
In the RL condition, it dropped from 
3.33 to 1.83. And in the RR condi- 
tion it dropped from 4.25 to 0.5. 


DiscussIon 


It is clear that the redundant strings 
were learned more quickly than the 
random strings. One S§ in the LR 
group expressed the difference when he 
commented after the experimental ses- 
sion, “The first list was like studying, 
but the second was like cramming.” The 
fact that Ss were organizing the strings 
into related clusters could be seen in 
the serial order of their recall on the 
later trials with the L lists. 

It is often assumed, at least for non- 
sense materials, that associative inter- 
ference will increase as the items to be 
learned are made more similar. This 
assumption may not be valid when the 
method of free recall is used, since 
Underwood (9) has shown that recall 
and intralist similarity are not system- 
atically related. In the present experi- 
ment the redundant lists have greater 
intralist similarity than the random’ 
lists,? yet the similarities are an ad- 
vantage rather than a handicap. The 
Ss avoided interference due to stimulus 
generalization by recoding the redundant 
strings. For example, List L: contains 
NNSG, NNSXG, NNXSXG, NNXXSG, 
NNXXSXG, and NNXXXSG. Given 
NNSG, these six strings are easily recoded 
as 00, 01, 11, 20, 21, and 30, where the two 
digits indicate the number of X’s preceding 
and following the S. Then SSXNSXG 
and SSXNXSG can be learned together, 


*The greater intralist similarity of the 
redundant strings was demonstrated by a 
search task. Lists of 30 and 60 redundant 
strings and 30 and 60 random strings were 
typed on separate sheets of white paper and 10 
Ss were asked to search for particular strings. 
With 30 alternatives, the median search time 
was 15.5 sec. for redundant strings, but only 
9.3 sec. for random strings. With 60 alterna- 
tives, the corresponding medians were 33.0 and 
15.5 sec. It took about twice as long to search 
through the redundant strings, and it is in this 
sense that one can say the redundant strings 
have a greater degree of similarity. We are 
indebted to Sherry Blumenthal for collecting 
these data. It is interesting to note that 
Fitts, Weinstein, Rappaport, Anderson, and 
Leonard (4) also found that the recognition of 
redundant visual patterns was slower than the 
recognition of random patterns in a search task. 
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since one is derived from the other by a 
simple inversion of letters. That leaves 
SXXSG as the ninth string, the only 
one that doesn’t “fit” with any other. 
By such grouping and recoding devices 
as this it is possible to reduce the diffi- 
culty of the task. Similarity of the 
strings is not a handicap if § does not 
choose to remember the strings in the 
form in which they are presented, but 
recodes them and remembers the coded 
form instead. 

If we consider the quantities of in- 
formation involved, we can ask whether 
the amount of information per trial is 
approximately the same for the redun- 
dant and the random strings. This 
would be a surprising result in any case, 
but especially in this experiment where 
Ss had no prior knowledge of the rules 
underlying the redundant strings and 
could scarcely be expected to exploit 
them in an optimal fashion. ‘ The ques- 
tion is easily tested with the data, how- 
ever. With random strings, the average 
S learned 3.63 strings on his first 10 
trials and these strings contained an 
average of 20.5 letters. At 2 bits/letter, 
this corresponds to 41 bits of information, 
or 4.1 bits/trial for the random strings. 
With redundant strings, on the other 
hand, the average S learned all nine 
strings, or 54 letters, in his first 10 trials; 
at 0.552 bits/letter, this amounts to 
29.8 bits, or approximately 3 bits/trial. 
These values, 4.1 and 3  bits/trial, 
actually underestimate the difference, 
since the computation for the random 
strings ignores the information S may 
have acquired about the strings he was 
unable to reproduce perfectly. This 
result confirms the conclusion of Aborn 
and Rubenstein (1, 6), who also found 
that with highly redundant strings Ss 
learn more symbols but less information 
than with random strings. 

In order to explore this question of 
information storage in more detail, 
eight Ss learned a list of nine random 
strings composed of G, N, S, and X 
where the total amount of information 
in the new set of nine random strings 
was approximately the same as the 
total amount of information in the set 
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of nine redundant strings. The total 
amount of information involved in the 
redundant strings was 29.8 bits, so the 
new random strings had to be composed 
of 15 letters at 2 bits/letter in order.to 
give the equivalent 30 bits of informa- 
tion. These 15 letters were arranged 
in three strings of Length 1 and six 
strings of Length 2. The experimental 
procedure was the same as before. 
The average number correctly recalled 
on successive trials was 6.0, 7.6, 8.0, 8.0, 
8.3, 8.5, 8.5, 8.6, 8.6, and 8.6 strings. 
If, in order to estimate the amount 
of information stored per trial, we assume 
that in three trials the average S had 
learned eight strings, or 26 bits, then 
we obtain an estimate of about 8.3 
bits per trial. This is a crude estimate, 
but even so it is clear that the amount 
of information acquired per trial is not 
a constant independent of the way the 
information is encoded. The shorter 
random strings are more easily memo- 
rized and provide a more efficient method 
of storing information. 

The total amount of information that 
S acquires in a series of trials with a 
given list can be broken up into at least 
four factors. The total amount of 
information acquired is equal to the 
number of letters learned times the 
average number of bits of information 
per letter. The number of letters 
learned is approximately equal to the 
number of strings correctly recalled 
times the average length of a string. 
The number of bits/letter is equal to 
the maximum possible bits/letter multi- 
plied by a factor that depends upon the 
redundancy of strings. Thus the four 
factors involved are the number of 
letters per string, the number of strings, 
the size of the alphabet used, and the 
redundancy of the strings. Symboli- 
cally, the amount of information S§ has 
acquired on any given trial is not less 
than 

N 
(1 — R) loge m > dys; 


t=1 


where m is the size of the alphabet used, 
R is the redundancy of the strings, A; 
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is the number of letters in the ith string, 
s; is a function that assumes the value 1 
if String i is recalled and 0 if String i is 
not recalled, and N is the number of 
strings. The independent variables un- 
der E’s control are m, R, N, and \j, in 


addition to the number of the trial, and © 


the dependent variable is s;, the strings 
which § recalls. 

With random strings, the evidence 
available indicates that m, the number 
of different letters, has relatively little 
effect on the number of trials required 
to reach a given criterion of mastery 
(8). R is, of course, zero for random 
strings, so the important variables are 
A; and N,-the length of the strings and 
the length of the list of strings to be 
memorized. With redundant strings, 
on the other hand, the task may actually 
become easier as m increases, since the 
rules underlying the redundancy usually 
become more obvious. As R increases, 


learning will become easier for any S$ 
who takes advantage of the redundancy, 
but the conditions which enable him to 
use it in order to simplify his task 
remain to be explored. The effect of 


increasing N when the strings are 
redundant should be far less than when 
they are random, since the strings fall 
into certain recognizable classes defined 
by the basic strings and the number 
of basic strings does not vary as a func- 
tion of N. The effect of A; is not obvi- 
ous. It would be interesting to keep 
the total number of letters constant 
while N and X; varied inversely. 

A principal question in such an analy- 
sis, however, is whether the measure of 
redundancy, R, is adequate to represent 
all the different kinds of redundancy 
that might lead to the same value of 
R. With respect to the finite state 
generators which determine the redun- 
dancy of the strings, the major vari- 
ables to be explored include the number 
of different letters used, the number of 
states, the number of rules of transition, 
and the number of basic strings. 

This quick glance at the parametric 
aspects of redundant strings makes their 
exploration look rather formidable, es- 
pecially when we remember the large 
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number of parameters we already have 
in studies of verbal learning. The 
prospect is further complicated by the 
necessity of exploring the conditions 
that facilitate the efficient discovery 
and use of the redundancies in such 
strings. This question raises problems 
more nearly akin to concept formation 
than to the usual studies of verbal 
learning. Although some psychologists 
seem to hold the opinion that concept 
formation is merely a complicated kind 
of verbal learning, the present analysis 
suggests the converse—namely, that 
verbal learning is merely a simple form 
of concept formation. In either view, 
however, it seems likely that the same 
problems will arise in both areas and 
that one will not progress without the 
other. 

The design of the present experiment 
is appropriate for the study of transfer 
effects in verbal learning. In Fig. 2 
there is apparent a positive transfer for 
the LL, LR, and RR conditions; the 
improvement may be simply another 
case of the well known “warm-up effect.” 
The RL condition, however, shows nega- 
tive transfer and Fig. 3 indicates that 
this negative transfer is due to a large 
increase in the variance between Ss. 
In the terminology of verbal learning 
experiments we should probably call 
this result a negative “transfer of 
principles,” since the prior learning of a 
random list apparently caused some Ss 
to accept the “principle” that the new 
task was also random. It is just as 
natural, however, to speak of an Einstel- 
lung effect, or of functional fixedness. 
The strategy that Ss adopted for memo- 
rizing redundant strings apparently did 
not handicap them when they later 
encountered random strings, but the 
reverse was not true: The strategy 
adopted for the random strings was not 
efficient when later applied to redundant 
strings. 

However, the Einstellung effect sug- 
gested by the increased variance in the 
scores obtained on redundant strings 
following prior experience with random 
strings may be peculiar to the particular 
finite state generator used in this experi- 
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ment. Subsequent studies with other 
generators have not shown this disrup- 
tion so clearly. Until we have a better 
conception of how Ss attempt to organize 
redundant strings, therefore, it is un- 
wise to overgeneralize this feature in the 
results. 


SUMMARY 


The present experiment explores the recall 
of redundant, as opposed to random, strings 
of symbols. First, an algebra for generating 
strings of symbols according to explicit rules is 
defined and a set of redundant strings is gener- 
ated. A subset of these strings is memorized 
by S, using the method of free recall. As a 
control, the same procedure is used for equiva- 
lent strings generated by a table of random 
numbers. It is found that, although S knows 
nothing of the rules of formation, the redundant 
strings are more easily memorized. Since the 
redundant strings show greater intralist simi- 
larity, it is concluded that Ss group and recode 
the redundant strings, thus avoiding the inter- 
ference effects that would be expected for the 
uncoded strings. 

In agreement with earlier studies, it is found 
that the amount of material learned increases 
when the strings are redundant, but the amount 
of information, measured in bits, decreases. In 
terms of the information measure, therefore, 
redundancy in the materials to be learned does 
not increase the efficiency of learning. The 
several parameters involved in these relations 
are briefly reviewed. 

The results also indicate that, for the particu- 
lar redundancy used, some Ss find the redundant 
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strings more difficult to learn if they have first 
learned a series of random strings. 
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In specifying the complexity level 
of a particular motor skill task, one 
finds that a major factor is the 
number of components or subtasks 
which define the whole task. Thus, 
as the number of components of a 
whole task is increased, S may find 
acquisition of the whole skill to be 
increasingly difficult. An obvious ex- 
pediency here involves initial training 
on parts or single dimensions with 
subsequent practice on combinations 
of components as in the part-whole 
paradigm. Unfortunately, practice 
on individual components generally 
does not increase proficiency levels 
on the whole task to that level 
achieved by whole-task training itself 
(1, 3, 5, 8). However, these differ- 
ences in performance between whole- 
and part-practice schedules at the 
time of transfer can be reduced 
rather quickly by continued post- 
transfer training on the whole task. 
This can result in less training re- 
quired on the whole task itself follow- 
ing part-task practice,.a saving which 
can be of considerable importance 
(1, 8). 

Except for one study (1), these 
generalizations are based on experi- 
ments in which the components de- 


' This research was conducted in the Labora- 
tory of Aviation Psychology of the Ohio State 
University and was supported in part by the 
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AF 41(657)-70 with the OSU Research Founda- 
tion, monitored by Robert L. French and Bryce 
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for the AF Personnel and Training Research 
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publication, use, and disposal in whole or in part 
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fining the whole task were physically 
independent and thus could be manip- 
ulated independently by Ss. For 
example, when S makes a correction 
for an azimuth error with the Pedestal 
Sight Manipulation Apparatus, it is 
not necessary to coordinate simul- 
taneous responses to the elevation 
or range dimensions of control, and 
the evidence suggests that S will 
indeed respond to each of the several 
components independently, on the 
average (2, 7). 

Most skill systems involve com- 
ponents which are not physically 
independent, one from another; inter- 
actions exist among the components 
of the whole system such that cor- 
rections applied to one component 
affect other components as well and 
therefore necessitate concomitant re- 
sponses by S to these other com- 
ponents. To illustrate, in the opera- 
tion of an aircraft, a lateral deflection 
of the control column has the primary 
effect of introducing roll with a 
subsequent change in heading. How- 
ever, this control action has the 
secondary effect of a reorientation 
of the lift vector with subsequent loss 
in altitude by the aircraft. Thus, 
the pitch and roll components of the 
system interact in such a way as to 
require S to pull back on his control 
column whenever he desired to dis- 
place that control laterally.2. By such 
a coordinated response, S can thereby 
cancel the effects of the pitch-roll 
(P X R) interaction. 


2We assume no rudder corrections are re- 


quired. 
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Fic. 1. The elements of the tracking dis- 
play and the schematized effects of component 
interaction on the display. Degrees refer to 
vector angles. 


The purpose of the present study 
was to examine the effects of such 
a system interaction on the acquisi- 
tion and transfer of skill. The major 
variable under study was the degree 
of P X R interaction present in the 
skill task (the amount of loss of lift 
resulting from lateral deflections of a 
control device in an aircraft simula- 
tor). Three hypotheses were tested 
concerning the effects of degree of 
component interaction on transfer 
performance levels: (a) the benefits 
of part-task practice, as measured 
on the whole task, will decrease as 
the degree of component interaction 
present in the whole task is increased ; 
(b) proficiency levels during transfer 
will increase with the degree of 
similarity of component interaction 
in the training and the transfer sys- 
tems; and (c) transfer from a higher 
to a lower degree of component inter- 
action will result in a greater level 
of transfer performance than will 
transfer from a lower to a higher 
degree of interaction. 


COMPONENT INTERACTION 
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The first hypothesis merely acknowl- 
edges that training on individual 
components cannot provide S with 
practice on a coordination of his 
responses such as will be required in 
operation of the whole task when that 
whole task involves an interaction 
of components. The second and 
third hypotheses indicate the expecta- 
tion of an asymmetrical generalization 
gradient, i.e., transfer from “difficult” 
to “easy” versions of the task is 
expected to involve a greater degree 
of transfer than that found with 
transfer from “easy” to “difficult” 
(9, 12). Thus, this experiment has 
two parts: Part I represents a test 
of the first hypothesis, while Part 
II tests the second and third. Data 
collection for both parts was accom- 
plished concurrently. 


METHOD 
Apparatus 


A version of the OSU Pilot Training Research 
Simulator (4) served as the skill task. The 
essential features of this device are illustrated in 
Fig. 1 and 2. The S was provided with a 5-in. 
cathode ray tube (CRT) display and a spring- 
centered, two-dimensional control column. He 
attempted to manipulate the control so as to 
hold a cursor at the center of a l-in. circular 
reticle. Only the cursor, identified as an own; 
position indicator, moved about the display- 
the reticle was fixed at the center of the CRT 








Scoring X 
Exponential Osciiiohon Integreton 


Fic. 2. The control dynamics of the X and 
Y (roll and pitch) dimensions of the tracking 
task. 
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and thus S was provided with a compensatory 
display of tracking information. 

Right or left defiections of the control column 
generated a voltage X which was subjected to 
two stages of analog integration. These trans- 
formations provide for a simulation of azimuth 
corrections in an aircraft, and the output affected 
lateral movements of the own-position indicator 
on the CRT. Fore or aft control deflections 
generated a voltage Y which was transformed 
by an exponential oscillatory function plus one 
stage of analog integration, a combination which 
simulates the underdamped response of an 
aircraft to pitch corrections. The output of 
the first integral term in the X dimension repre- 
sents bank angle, and this signal was fed to a 
sine-cosine resolver, the output of which deter- 
mined the angle of incidence of the own-position 
indicator. Rotation of this indicator repre- 
sented the lateral attitude of the “wings” of S’s 
aircraft. 

The desired physical interaction was achieved 
by coupling the X to the Y dimensions of the 
control system as shown in Fig. 2; the voltage 
signal representing bank angle was subjected 
to attenuation, rectified, and summed with the 
Y signal at the integral transformation in the 
pitch control portion of the system. This 
rectified signal always appeared as a negative 
quantity at the summing point, and thus S$ 
was required to supply a positive signal of like 
magnitude in order to keep the cursor from 
moving off the lower portion of the CRT. This 
was achieved by deflecting the control column 
back from its spring-centered position. The 
attenuation enabled £ to vary the amount of 
component interaction from zero to a maximum, 
at which level the cursor followed a vector path 
whose angle was 76° from horizontal. At zero 
interaction, lateral deflections of the control 
column affected only the X or roll dimension of 
the system, and thus S could respond to both 
dimensions independently. The presence of any 
degree of interaction other than zero ‘resulted 
in cursor movement along vector paths, as 
illustrated in Fig. 1, and S could no longer 
correct the pitch and roll dimensions inde- 
pendently. Six levels of component interaction 
were selected for study, as shown in Fig. 1. At 
(P X R)s, for example, a lateral deflection of the 
control column resulted in cursor movement at 
a downward angle of 23°, or 30% of the maxi- 
mum possible vector. 

The input signals indicated in Fig. 2 were 
produced by two electronic random signal 
generators. These signals were peaked at 
3 cpm with 95% of their power distributed below 
12 cpm. The amplitude distribution of each 
input was normal with an SD of 3.9 v. The 
diameter of the l-in. circular reticle corre- 
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sponded to +3 SD’s of the input signal. With 
these inputs the own-position indicator was 
forced away from the center of the reticle 
in a random manner, and S was instructed to 
compensate for his error by appropriate control 
deflections. The X input was statistically 
equivalent to, but independent of and un- 
correlated with, the Y input. 


Subjects and Procedure 


A total of 160 Air Force ROTC students were 
paid for service in this experiment. None had 
had prior flight training. Each S served for 
four 45-min. sessions with 24 hr. between ses- 
sions. During the first session, all Ss received 
a total of eight 30-sec. tracking trials following 
detailed instructions and demonstrations by £. 
There were a total of 16 30-sec. trials during 
each of the subsequent sessions, and practice 
was carried out in four-trial blocks. There was 
30-sec. rest between trials within blocks and 
4min. rest between blocks. The Ss served in 
pairs, alternately tracking and observing in 
four-trial blocks. 

Part I—Four degrees of P X R interaction 
were used in a test of the hypothesis that a 
part-task training schedule will be progressively 
less efficient as the component interaction 
present in the whole task is increased: (P X R)o, 
(P X R):, (P xX R)s, and (P x R)s. As indi- 
cated in Table 1, Groups 1 through 4 served 
as the experimental groups. There were totals 
of 12 30-sec. trials on pitch corrections alone 
and 12 trials on roll corrections alone for each 
S over Sessions 1 and 2. Transfer to the 
designated whole-task condition took place 
in Session 3. Groups 5, 6, 8, and 10 served as 
control groups against which to compare the 
performance levels of Groups 1-4, respectively, 
in Session 3. 

Part II.—As a test of the second and third 
hypotheses six groups began and continued 


.training on the two-dimensional tracking task 


where level of component interaction defined 
groups. Table 1 indicates that Groups 5, 6, 7, 
9, and 10 served as experimental groups with 
each § receiving a total of 40 30-sec. trials of 
tracking under the interaction level of his 
group prior to transfer to (P X R), during 
Session 4. Thus, Group 8 provided the control 
procedure against which to measure levels of 
transfer. All groups received a total of 16 
trials on (P X R)s5 during the transfer session. 
As shown in Table 1, Groups 6 and 7 transferred 
to a higher degree of component interaction 
compared to that present in the training system, 
while Groups 9 and 10 transferred to a lower 
level, and Group 5 transferred from a system 
involving no component interaction. The 
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TABLE 1 
f ExpeRImMENTAL Desicn For Parts I anp II 
(16 Ss per group) 
Sessions 
Part eee. bes ae 2S ee eee eee 
i 2 3 4 

I l P, R* P, R® (P X Rho (P X R)o 
I 2 | P,R (P XR), (PX R), 
I 3 PLR P,R (P X R); (P & R); 
I 4 i P,R (PK R)y (P X Rs 
I, I 5 (P X R)o (P X Ryo (P X R)o (P X R)s 
I, Il 6 (PX R); (P XR); (P X R); (P X R)s 
II 7 (P X R); (P X R)s (P X R)s (P X R)s 
I, Il 8 (P & R)s (P & R)s (P X R)s (P & R)s 
Il 9 (P KR); (P & R) (P X& R) (P & R); 
I, Il 10 (P & R)y (P X R)s (P X R)» (P <& R); 





* Four trials pitch corrections alone, four trials roll corrections alone. 


> Eight trials pitch corrections alone, eight trials roll corrections alone. 


choice of interaction level (P K R)s as the 
transfer condition was arbitrary in the sense 
that it is not necessarily typical of any particular 
skill system now in use. 


Scoring 


On each trial E recorded three measures of 
tracking performance: (a) two integrated error 
squared scores, one for the X and one for the Y 
dimension, and (6) one integrated cross-product 
of the X and Y errors. Although this was a 
two-dimensional tracking task, it was possible 
to derive a single, unitary score of tracking 
proficiency. This score, identified as the 
standard error (SE,,), was determined by 
combining the above measures as follows: 


[Je +> Se, _ 2Sfe.ey} 


where ¢ represents tracking error. The squar- 
ing, multiplying, and analog integration opera- 
tions were performed continuously by electronic 
circuits during each 30-sec. trial. These SE,, 
scores were later summed and averaged for each 
S over each four-trial block, a total of 2 min. of 
tracking. 


RESULTS 
Part I 


Figure 3 summarizes the data in 


test of the hypothesis concerning - 


the efficiency of part-task practice 
as a function of the degree of whole- 
task component interaction. During 
the part-task training trials, Blocks 


1-6, the performance levels of Groups 
1-4 are presented separately for the X 
(roll) and Y 


(pitch) components. 
The square roots of the integrated 
error squared scores (RMS, and 


RMS,) serve as the metrics here. 
The standard error (SE,,) measure, 
discussed above, serves to describe 
the acquisition functions of the con- 
trol groups on Sessions 1, 2, and 3 
(Groups 5, 6, 8, and 10), and the 
same measure describes the transfer 
performance levels of Groups 1-4 
during Sessions 3 and 4. Each point 
in Fig. 3 is the average performance 
level of the 16 Ss in each group over 
the four trials in each block. The 
interaction levels of the X and Y 
components in the whole or transfer 
tasks appear as the parameter in 
Fig. 3. 

Training blocks.—Several points are 
apparent from visual inspection of 
the data defining tracking perform- 
ance during Blocks 1-6. First, dur- 
ing the part-task training trials, none 
of the experimental groups show a 
significant change in performance 
as a function of trials. The RMS, 
and RMS, scores are uniformly low 
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Four Trial Blocks 


Fie. 3. 


Training and transfer performance levels for the control 


and experimental groups of Part I. 


with little or no negative trend. It 
follows, then, that skill in tracking 
with the roll component alone and 
with the pitch component alone was 
easily acquired early in the training 
trials. This is in contrast to the 
four control groups (Groups 5, 6, 8, 
and 10) over the same period where 
initial tracking proficiency was quite 
low, especially for Groups 8 and 10. 
Further, skill levels on the whole 
task, as shown by the control groups, 
never did approach closely those 
levels achieved by Groups 1-4 during 
the training period. This, in part, 
may be a function of the differences 
in power of the inputs involved. The 
RMS value of the input for one- 
dimensional tracking was 3.9 v., 
while the standard error measure 
of the inputs for two-dimensional 
tracking was 5.5 v. These estimates 


were based on 10-min. samples of the 
input signals. Thus, Groups 1-4 
did exceed the RMS level of the input 
(RMS, and RMS, on Block 6 aver- 
aged 3.3 v.), while the control groups 
never did achieve SE,, measures of 
tracking performance lower than that 
for the input (5.8, 6.5, 6.3, and 6.0 v. 
for Groups 5, 6, 8, and 10, respectively, 
on Block 10). 

Transfer blocks —A comparison of 
the initial transfer performance levels 
of Groups 1-4, the Block 7 data of 
Fig. 3, with the Block 1 data of the 
appropriate control groups shows 
that some degree of positive transfer 
was achieved by each of the experi- 
mental groups. A convenient index 
to express this transfer is 


C, — E; 


GC, —c, * 1 











COMPONENT INTERACTION 


where C, is the performance of a 
control group on Block 1 (13.1, 13.2, 
16.2, and 16.3 v. for Groups 5, 6, 8, 
and 10, respectively), C, is the 
tracking proficiency of the same 
groups on Block 7 (7.3, 7.1, 6.5, and 
6.9 v.), and E, is the performance 
level of the appropriate experimental 
group at Block 7 (11.4, 10.8, 14.5, 
and 14.6 v. for Groups 1-4, respec- 
tively). These indices are 29%, 39%, 
18%, and 18% positive transfer for 
Groups 1-4, respectively. 

These transfer indices substantiate, 
only in part, the hypothesis under 
test here. It was expected that the 
efficiency of pure part-task practice 
would be progressively lessened as the 
degree of component interaction pres- 
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ent in the whole task was increased. 
The decrease in efficiency found here 
was not progressive in that the 
(P X R); level actually enjoyed a 
higher transfer index than did the 
(P X R)o condition. However, both 
the higher levels of component inter- 
action, Groups 3 and 4, did enjoy 
less benefit from part-task practice 
than did the lower degrees of inter- 
action, Groups | and 2. 

The above observations were borne 
out by the results of analyses of 
variance which were performed on the 
data of Session 3, Blocks 7-10. The 
results of these analyses show that 
differences between Groups 1 and 5 
and between Groups 2 and 6 are 
statistically significant at P < .05, 
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Four Trial Blocks 


Fic. 4. 


Training and transfer performance levels for the control 


and experimental groups of Part II. 
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while those for Groups 3 and 8 and 


for Groups 4 and 10 are significant at 
P < .0O1. 


Part II 


Figure 4 summarizes the results of 
the test of the second and third 
hypotheses, concerning transfer per- 
formance levels as a function of 
similarity of component interaction 
in training and transfer systems. 
Again, the SE,, metric was employed, 
and each point in Fig. 4 is the average 
performance level of 16 Ss in each 
group over the four trials in each 
block. As indicated earlier, there 
were 10 blocks of training trials 
prior to transfer to the (P X R); 
level of component interaction. 
Transfer was effected on Block 11. 
For convenience, the data of Block 11 
have been replotted in the upper 
right of Fig. 4 where the training 
interaction level appears as the 
independent variable. 

Training blocks —As shown earlier 
by Fig. 3, there exist differences 
among the several interaction groups 
early in training (Blocks 1 and 2), 
but all groups appear to be approach- 
ing closely the same terminal levels 
of performance by Session 3 with 
considerable amounts of crossing of 
the acquisition functions during 
Session 2. 

Transfer blocks —The transfer per- 
formance levels of Groups 5 through 
10, as shown by the Block 11 data of 
Fig. 4, in part confirm the hypotheses 
under test; the figure shows an 
asymmetrical generalization gradient, 
as predicted, but with that portion 
of the gradient covering the lower 
interaction levels being somewhat 
more regular than the gradient at 
levels higher than the control condi- 
tion, (P X R)s. Further, the average 
transfer performance level of Groups 


GEORGE E. BRIGGS AND LAWRENCE K. WATERS 


9 and 10 is somewhat better than the 
average of Groups 6 and 7, although 
this difference is not statistically 
significant (P > .05). Thus, trans- 
fer from higher to lower levels of 
component interaction and vice versa 
was in the direction predicted, but 
the difference was not reliable. 

The same index as was used in 
Part I can serve to specify the 
amounts of transfer here. The value 
of C, is 16.2 v. and that of C, is 5.7 v. 
(Group 8), while E, is 8.9, 7.6, 6.7, 
7.1, and 5.8 v. for Groups 5, 6, 7, 9, 
and 10, respectively. These indices 
as calculated are 70%, 82%, 90%, 
87%, and 99% for Groups 5, 6, 7, 9, 
and 10. The trend of these per- 
centages is, of course, in line with the 
above observations concerning the 
asymmetry of the generalization 
gradient during the initial transfer 
trials. 

An analysis of variance for re- 
peated measures was applied to the 
data of Blocks 11-14. A statistically 
significant (P < .01) interaction of 
Blocks X Groups was noted, and the . 
Duncan Multiple Range Test (6) 
was employed to determine the pat- 
tern of significance among the several 
means defining that interaction. Of 
particular importance are the differ- 
ences among means for Block 11. 
The results of the Duncan test show 
that (a) the four means found on 
Block 11 for Groups 5, 6, 7, and 8 
are all statistically different, one 
from another, at P < .05, (b) the 
mean for Group 9 is not significantly 
different from that found for either 
Group 6 or Group 7 (P > .05), and 
(c) Group 10 does not differ signifi- 
cantly from Group 8 (P > .05). 

Thus, whereas a statistically sig- 
nificant generalization gradient was 
noted for the data of Block 11, as 
predicted, the form of the gradient 
departs at one point from that 
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expected: the level of proficiency for 
Group 10 was significantly higher 
than anticipated. Coupled with the 
finding that transfer from “difficult” 
versions of the tracking task was 
not significantly greater than that 
from “easy” versions, it is concluded 
that the second hypothesis was con- 
firmed only in part, while support for 
the third hypothesis is only suggested 
by the data. 


Discussion 


The above results indicate that, as a 
variable in complex skill acquisition and 
transfer, component interaction follows 
a fairly predictable pattern. Both the 
hypothesis concerning part-task practice 
(Part I) and the prediction of a general- 
ization gradient (Part II) were supported 
by the data, with but a single departure 
from expectations in each case. The 
data, therefore, reveal no major depar- 
tures from predictions, save the finding 
in Part II that transfer from higher to 
lower levels of component interaction 
was not significantly superior to transfer 
from lower to higher levels. A minor 
surprise comes from the rather low 
index of transfer found in Part I for 
Group 1: after a total of 24 trials of 
practice independently on the pitch 
and on the roll components, that group 
shows only 29% transfer to a whole 
task which involves no component 
interaction. As pointed out earlier, 
this version of the whole task does not 
require S to coordinate his responses 
to the two dimensions of the system, 
and as such it was expected that transfer 
from part to whole tasks would be high, 
at least in the order of 50% or more 
(3, 5). 

The relatively low index of transfer 
found here for Group 1 can be con- 
trasted with that reported by Briggs, 
Fitts, and Bahrick (4), where approxi- 
mately 61% positive transfer was 
achieved by a group (Group 2) which 
transferred after 30 training trials to a 
system identical to that encountered 
here by Group 1. The major difference 
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between the training tasks for these 
two groups is that the present Group 1 
practiced the individual system com- 
ponents (X and Y) while the previous 
group practiced on a simplified but 
two-dimensional version of the common 
transfer task (that earlier training sys- 
tem required S§ to track through one 
rather than two stages of integration 
for bank maneuvers). It was shown 
in the earlier report that this reduction 
in number of integral lag terms reduced 
the complexity of both the perceptual 
and the motor responses required of S. 

A critical consideration, then, is that 
the number of dimensions in a whole 
task be preserved in a training system 
to assure a reasonably high level of 
transfer. It follows from this that, 
in determining the characteristics of a 
training task, simplification rather than 
fractionation of a whole task may be 
expected to provide greater amounts of 
transfer, especially when the whole task 
requires or permits simultaneous re- 
sponses to the several components or 
dimensions of the control system. Thus, 
it appears that reduction in an op- 
portunity to integrate component skills, 
as in part practice, is far more detri- 
mental to transfer performance than is 
simplification in either the perceptual or 
motor response requirements. 

For the present, the above generaliza- 
tions should be restricted to whole tasks 
in which simultaneity of responses is 
required or possible. The part-whole 
training data from tasks in which re- 
sponses to the parts must be carried 
out in a serial order appear to be in 
conflict (10, 11). However, Seymour 
(11) suggests that in such tasks either 
pure part- or progressive part-task 
practice schedules are more efficient 
than the whole-task schedule, especially 
when the perceptual requirements of the 
whole task are complex. 


SUMMARY 


Ten groups of 16 Ss each received a total 
of 56 30-sec. trials tracking through simulated 
aircraft control dynamics. The variable under 
consideration was the degree of interaction 
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between two of the major components of this 
tracking system, the lateral and the vertical 
dimensions of flight control. 

Three hypotheses were tested: (a) pure 
part-task practice will be progressively less 
beneficial, measured by transfer performance 
levels, as the degree of component (part) 
interaction is increased in the transfer or whole 
task, (6) transfer performance level will be 
propertional to the similarity of component 
interaction in the training and transfer tasks, 
with (c) transfer from higher to lower degrees 
of component interaction being greater than 
that found when transfer is from lower to higher 
degrees of such interaction. 

The hypotheses were substantiated, in part, 
with the third hypothesis not receiving the 
expected support of statistical significance. 
The theoretical significance of relatively low 
indices of transfer found in the test of the first 
hypothesis was discussed. It was concluded 
that simplification rather than fractionation 
of a whole task provides for the more efficient 
training schedule when the whole requires or 
permits simultaneous responses to the several 
parts. 
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ACQUISITION OF A VERBAL MAZE AS A FUNCTION OF 
METHOD OF CORRECTION AND NUMBER OF 
ALTERNATE CHOICES PER UNIT! 

R. F. THOMPSON anv W. J. BROGDEN 


University of Wisconsin 


Experiments by Brogden and 
Schmidt (1, 2) have shown that diffi- 
culty in acquisition of verbal mazes 
increases linearly as a function of the 
number of alternate choices per unit. 
These results hold for total time and 
total errors to the criterion, for mazes 
of 16 and 24 units, and for number of 
alternate choices per unit of 2, 4, 6, 
and 8 for the 16-unit maze and of 2, 
3, 4, 5, 6, 7, 8, 9, 10, 11, and 12 for the 
24-unit maze. The variable number 
of alternate choices per unit is repre- 
sentative of the dimension of amount 
of discovery as defined by Melton 
(4). The increase in amount of dis- 
covery and the decrease in number of 
like responses that go with increase 
in number of alternate choices per 
unit probably account for the linear 
increase in difficulty of acquisition. 

In view of the discovery component 
of number of alternate choices per 
unit, the acquisition of verbal mazes 
should be less difficult if discovery 
time is reduced by alteration of the 
correction procedure. This hypothe- 
sis was tested and confirmed by 
Thompson (6) by comparing the 
standard ‘correction procedure with a 
modified correction procedure that 
allowed S to make only a single error 
at a choice point, following which E 
gave him the correct response. 
Thompson used mazes of 12 units in 
length that involved 2, 3, 4, 5, or 6 
alternate choices per unit. The cor- 


1 This research was supported in part by a 
grant from the Research Committee of the 
Graduate School from funds provided by the 
Wisconsin Alumni Research Foundation. 








rect response for each unit was pre- 
sented to S by a memory drum after 
he had spoken the correct regponse 
(correction procedure) or after a 
single response (modified correction 
procedure). This differed fromm the 
procedure of Brogden and Schmidt 
in which S spoke the alternate choices 
for each choice point until the correct 
response was made, when E spoke 
the word “Correct.” 

The present paper is a report of two 
experiments: one a methodological 
study that has led to modification of 
procedure, and the other designed to 
extend knowledge about the method of 
correction in acquisition of verbal 
mazes of greater length and with a 
greater number of alternate choices 
per unit than studied by Thompson 
(6). 


PROCEDURE 


Experiment I.—This experiment is compar- 
able to the experiment reported by Brogden and 
Schmidt (1) except that the mazes were con- 
structed from 2-digit numbers instead of words 
representing spatial locus and that the response 
was written instead of spoken. The mazes were 
24 units in length with 2, 4, 6, 8, 10, and 12 
alternate choices per unit. 

The S’s responses were written on a Telauto- 
graph apparatus, the receiving machine being 
viewed in another room by E. Communication 
between E and S was by an intercommunication 
system. The S was given the choices for his 
maze, and instructed in the correction pro- 
cedure. When E£ said “Begin,” S was to write 
down the numbers used in any order until he 
wrote the correct one, whereupon E said “For- 
ward” and S moved to the next unit, and so on, 
until one perfect repetition was completed by S. 
A 30-sec. intertrial interval was maintained 


throughout practice. 
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Experiment I1—Mazes of 24 units length 
with 2, 4, 6, 8, 10, and 12 alternate choices per 
unit were learned to the criterion of one perfect 
trial by the standard correction procedure 
and by the modified correction procedure. 
There were 14 Ss in each of the 12 subgroups, 
each S serving in a single subgroup only. The Ss 
were volunteers from the classes in elementary 
psychology at the University of Wisconsin. 
The maze patterns were identical with those 
used in the above study employing Telauto- 
graph apparatus, and so identical with the 
Brogden and Schmidt (1) patterns, except that 
2-digit numbers were coded for words. 

The instructions and practice on the mazes 
followed the condition used by Brogden and 
Schmidt (1) except for use of the modified 
correction procedure and use of numbers instead 
of words. Thus, in the correction procedure, S 
was presented with the numbers to be used in 
his maze and told to respond verbally with the 
numbers until he gave the correct response, 
whereupon E said “Forward,” and so on until 
one perfect repetition of the maze was completed. 
In the modified correction procedure, S was told 
to respond only once, whereupon £ gave the 
correct response. In addition, no attempt was 
made in the present experiment to maintain the 
analogy with a spatial maze. The two Es 
each tested half the Ss in each experimental 
condition.2. The E recorded each error and the 
time of the trial on prepared data sheets. A 
30-sec. intertrial interval was maintained 
throughout practice. 


REsuULTs 


Experiment I.—Analyses of vari- 
ance were completed on the time and 


total errors to criterion data. Both 
measures were found to _ increase 
significantly (5% level) with increas- 
ing number of choices per unit (for 
time, F equaled 6.89 with 5 and 54 df; 
for errors, F equaled 17.08 with 5 and 
54 df). Tests for deviations from 
linearity (5, pp. 317-320) for both 
measures were not significant. Best- 
fitting straight lines were obtained 
for each measure by the method of 
least squares. “Ihe comparable data 
from the Brogden and Schmidt study 
(1) for 2, 4, 6, 8, 10, and 12 choices 

2 The authors wish to express appreciation to 
Davis Chambliss who assisted in the conduct 
of the experiment by testing half of the Ss. 
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per unit was combined with the 
present data for both time and total 
errors measures and the pooled data 


‘treated statistically as a single ex- 


periment, with the two methods of 
running S as one variable. Analyses 
of variance were completed on both 
pooled measures, and for both, meth- 
ods and mazes were significant at the 
5% level, but the interactions of 
methods and mazes were in neither 
case significant. For both measures, 
the mean scores obtained in the 
present study are larger than those 
of the Brogden and Schmidt study 
The significance of the diffe 
in slope of the best-fitting straighv 
lines for the Brogden and Schmidt 
data and the present data was tested 
by the procedure described by Wilks 
(7, pp. 157-159) for both time and 
total errors. In both cases the t¢ 
ratios were less than one, demon- 
strating no significant difference in 
the slopes of the straight lines ob- 
tained in the two studies. Hence, 
it may be concluded that the two 
different methods of running Ss 
yield functions of the same slope but 
of different absolute magnitude. 
Experiment II.—Analyses of vari- 
ance were completed on the data 
for time, total errors, total first 
errors, and trials to criterion and the 
results are given in Table 1. Since 
Bartlett tests for homogeneity of 
variance of the data indicated sig- 
nificant heterogeneity of variance, the 
1% level of significance will be em- 
ployed in the place of the 5% level. 
The results of the analyses show 
increase for all measures with in- 
creasing number of choices per unit 
(mazes). The two procedures differ 
in terms of time and total errors 
(larger scores for the correction pro- 
cedure) but do not differ in terms of 
first errors and trials. The E is a 
significant variable for tota! errors 
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TABLE 1 
ANALYSES OF VARIANCE 
| Time | Total Errors First Errors | Trials 
Source =i : 
df | | ~_ | | 
| MS F MS F | MS F MS PF 
Mazes (M) 5 | 10,838,670 | 17.14* 11,128,100 31.51* | 84,086 8.24* 282 | 3.17* 
Procedures (P) | 1 | 31,835,926 | 50.35* |7,716,429 215.53° | 17,856 | 1.75 | 363 | 4.08 
Experimenter (E) 1| 3,329,018 | 5.26 | 603,241 | 16.85*| 52,223 | 5.12 518 | 5.82 
MXP | 51] 2,042,059 | 3.23*| 623,841 | 17.42 6,653 | 65 | 39 44 
MXE | 5 416,364 66 54,071 1.51 2,636 | .26 | 50 56 
PXE 1 | 7,406,677 | 11.71*| 474,991 | 13.27*| 19,891 | 1.95 82 92 
MXPXE | 5 | 1,252,134] 1.98 124,922 | 3.49*| 15,097 | 148 | 178 | 2.00 
Error 144 632,341 | 35,801 10,209 89 
| | | | eee ee SS 
*P = 01 


only. The interaction of mazes and 
procedures is significant for time and 
total errors only. For both time and 
total errors, the procedure by Es 
interaction is significant, and nnally, 
the triple interaction of mazes by 
procedures by Es is significant for 
total errors only. 

An orthogonal polynomials analysis 
was completed on all four measures of 
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total error (below), and number of alternate 
choices per unit with correction and modified 
correction procedures as parameter. 


acquisition (3). For time and total 
errors for both the correction and 
modified correction procedures, the 
linear component, and only the linear 
component, is significant. The equa- 
tions for the best-fitting straight lines 
were computed and are presented in 
Fig. 1 with plots of the curves and 
the empirical data points. For the 
trials measure under the modified 
correction procedure, no components 
were significant; under the correction 
procedure, the only significant com- 
ponent was the cubic. No compo- 
nents were significant for the first 
errors measure under the modified 
correction procedure. For the first 
errors measure under the correction 
procedure, however, the linear, quad- 
ratic, and cubic components were 
statistically significant. Equations 
for the trials and first error measures 
under the correction procedure were 
computed and are presented in Fig. 2 
with plots of the curves and the 
empirical data points. 

The significant effect of E and the 
interaction of mazes, procedures, and 
E on the total errors measure (‘Table 
1) were at first thought to be due to 
sampling differences, since the two 
Es ran Ss in different semesters, thus 
failing to fulfill completely the re- 
quirement of randomization. In or- 
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der to check on sampling differences, 
the Ss’ grades and ACE scores were 
obtained and analyses of variance 
were completed to compare the two 
groups of Ss on these measures. No 
significant differences between groups 
were found. Hence it seems reason- 
able to conclude that the differences 
are a function of £ and are not due to 
sampling error. Total errors to the 
criterion are plotted against number 
of alternate choices per unit in 
Fig. 3 with Es and procedures as the 
parameters. 

The curves for both Es are essen- 
tially the same with the modified 
correction procedure, but with the 
correction procedure the curves of 
E, for both time and errors show 
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Fic. 2. Relation between trials to criterion, 
correction procedure (above); and relation 
between first errors to criterion, correction 
procedure (below), and number of alternate 
choices. In the equation for each of the two 
curves, x = 0 is the midpoint (7) for the scale 
of the independent variable, number of alter- 
nate choices per maze unit. Each positive or 
negative unit of x is equal to two units of the 
independent variable. 
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much greater values for the mazes 
with more than six alternate choices 
than do the curves for £. 


Discussion 


The results of Exp. I demonstrate 
that use of numbers and written rather 
than spoken responses have no effect 
upon the form of the functions obtained 
in 24-unit verbal mazes over differing 
numbers of alternate choices per unit. 
Numbers were found to be simpler in 
terms of presentation to S than were 
the directional words used previously 
(1, 2), and were adopted for Exp. II 
and in subsequent work with verbal 
mazes. 

In Exp. II, the results of the analyses 
of variance and the orthogonal polyno- 
mials analyses for time and total errors 
are in complete agreement with those 
obtained by Thompson (6) for the 
12-unit directional word verbal mazes. 
Both measures increase linearly with 
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increasing numbers of alternate choices 
per unit, but the slopes of the functions 
for the correction procedure are steeper 
than those for the modified correction 
procedure (see Table 1 and Fig. 1). 
For the measures trials and first errors, 
no effect of number of choices was 
obtained under the modified correction 
procedure. With the correction pro- 
cedure, the effect on trials has only 
significant cubic components whereas 
that for first errors has significant 
linear, quadratic and cubic functions 
(see Fig. 2). These results are similar 
to those obtained by Thompson (6) 
in that significant effects for trials and 
first errors were obtained only for the 
correction procedure, but different in 
that the functions were linear. The 
difference between 12 and 24 units of 
length for the mazes in the two experi- 
ments probably accounts for the dif- 
ference in results. The 24-unit maze 
represents a learning task of consider- 
ably greater difficulty. The question- 
able validity of trials in comparing 
mazes (see discussion by Brogden and 
Schmidt [1], and Thompson [6]) and 


the lack of any reasonable hypothesis 
to account for the cyclical variation in 
number of trials to acquire: the maze 
as a function of number of alternate 
choices place the fitted function in an 


indeterminate category. The fitted 
function for the relationship of first 
errors and number of alternate choices 
with the correction procedure must be 
placed in the indeterminate category 
also because of the lack of any reasonable 
hypothesis. 

The significant effects of Es and the 
interactions of Es by Procedures and 
Es by Procedures by Mazes for total 
errors and of Es by Procedures for time 
(Table 1) are shown in Fig. 3 to be 
primarily a function of more errors and 
more time under the correction pro- 
cedure for Ss run by E; on mazes with 
six or more alternate choices than for 
Ss tested by £,. That interaction 
occurs between E and S in experiments 
on learning is self-evident, but in general 
E has not been considered to be a critical 
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variable in determining the experimental 
results. In the present experiment, 
there is much more interaction between 
E and S§ with the correction procedure 
than with the modified correction pro- 
cedure. The quickness of E’s response 
“Forward” when S makes the correct 
response, his tone of voice, and other 
personal characteristics appear to be 
representative of the kinds of difference 
in behavior between Es that might 
account for the difference in performance 
by Ss. Even though precise informa- 
tion is lacking on what differences be- 
tween Es are responsible for the differ- 
ences in experimental results, the effect 
is of considerable importance. Experi- 
menter effects may be responsible for 
differences in experimental results when 
the same experiment has been replicated 
in different laboratories by different Es. 
In any case, consideration must be 
given always to the possibility in many 
experiments on learning that the ob- 
tained results are in part a function of 
the personal characteristics of E. 


SUMMARY 


When the experimental procedure of Brogden 
and Schmidt (1, 2) was changed in Exp. I 
from a vocal to a written response and from 
words to 2-digit numbers, the relations between 
time and total errors to the criterion and num- 
ber of alternate choices per maze unit were 
comparable except for the absolute magnitude 
of the measures. 

In Exp. II Ss learned the mazes of Exp. I 
under two procedures—the standard correction 
procedure and a modified correction procedure. 
Two Es tested half the Ss in each of 12 condi- 
tions (number of alternate choices per unit of 
2, 4, 6, 8, 10, or 12, and correction and modified 
correction procedure). 

Significant differences in slope were found 
for the linear functions of time and of total 
errors and number of alternate choices for the 
correction and modified correction procedures 
comparable to those obtained by Thompson 
(6) for the 12-unit word maze. Statistically 
significant effects of Es were obtained for the 
functions of time and total errors primarily 
under the correction procedure. This latter 
result suggests that consideration must be 
given to the possibility in many experiments on 
learning that the results may in part be a func- 
tion of the personal characteristics of E. 
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MOTIVATION SHIFT IN A COMPLEX LEARNING TASK! 


DAVID BIRCH 


University of Michigan 


Theoretical analyses and experi- 
mental investigations growing out of 
the Hullian postulate relating habit 
and drive, R= f(E£) = f(H x D), 
have emphasized the importance of 
the initial response hierarchy in 
determining performance (15, 18). 
According to the theory, an increase 
in the strength of D should result in 
improved performance if the response 
hierarchy is such that the correct 
response tendency is dominant, but 
may result in no difference or even in 
poorer performance if incorrect com- 
peting responses are relatively equal 
in strength to the correct response 
(a so-called complex learning task). 
Experimental studies, which have 
produced data relevant to the theory, 
have employed two methods for 


obtaining differences in D among 


groups. In some studies (3, 6, 7, 11, 
12, 16, 17, 18, 19, 20), Ss were selected 
for degrees of emotionality by the 
Taylor scale (21). Other experiments 
(1,9, 13) have performed experimental 
manipulations designed to produce 
differential D levels in Ss _ while 
combinations of selection and manipu- 
lation have also been used (5, 8, 10, 
14). In these studies, which range 
considerably in complexity, the as- 
sessment of the initial response hier- 
archy was made on a priori grounds, 
and traditional learning measures, 





such as trials and/or errors to cri- 
terion, were used to evaluate the 
theory. ? 

The present study is concerned 
with the effects of an experimentally 
introduced shift in motivation on 
performance in a complex task but 
utilizes an alternative method. In 
brief, the application of the method 
is as follows: Ss are allowed the 
opportunity to learn R, to two sets of 
single stimuli, Sa; and Spg:;, and R,» 
to two other sets of single stimuli, 
Sa2 and Spe, and are tested on the 
single sets, Sai, Sae, Spi, and Spo, 
and also on the compound sets, 
SaiSpi and Sa:Spe. At a given level 
of training Ss are divided into two 
groups, one of which received in- 
structions designed to increase its 
level of D over that of the other, 
and tested again on the complete - 
sets of stimuli. Thus, the method 
attempts an assessment of the re- 
sponse hierarchy of the two groups 
before and after a differential shift 
in motivation. A model for re- 
sponse tendency combination (4), 
constructed to treat the problem of 
the probability of correct response to 
a two-element compound stimulus as 
a function of the probability of cor- 
rect response to the two elements 
where two responses are available to 
S, is appropriate for the assessments. 


Application of the model allows the determination of the values of four parameters from the data. 
These parameters are Pr(Da>d),,Pr(Da< —d), Pr(Dp>d), and Pr(Dpa<—d) where Da and Ds 


are ‘the differences between the correct and incorrect E’s for Stimuli A and B, respectively. 


The 


parameter, d, is chosen so that the following conditional probabilities are satisfied : 


Pr(A,+|Da>d)=1, Pr(A,+|[Dax—d)=0, Pr(A,+|—d<D,<d)=.5, 


Pr(B,+|Ds>d)=1, Pr(B,+|Ds<—d)=0, and Pr(B,+/—d<Dg<d)=.5. 


1 This study was carried out under a project with the Michigan Bell Telephone Company. 


Daniel Solomon and Astrid Beigel served as Es. 
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For convenience, a single symbol, d, has been used in these expressions though there is no necessity 
that |—d| = d or that the d’s be equal for A and B. 
It follows that 


Pr(A,+) = (.5)[1+Pr(Da>d)—Pr(Da<—d)] [1] 
and 


Pr(B,+) = (.5)[1+Pr(Ds>d)—Pr(Da<—d)] [2] 


where Pr(A,+) and Pr(B,+) are the probabilities of correct response to the single stimulus sets, 
Aand B. Four classes of compound stimuli, (A,+;B,+), (A,+;B,—), (A,—;B,+) and (A,—;B,—), 
in the expected proportions, Pr(A,+)-Pr(B,+), Pr(A,+)-Pr(B,—), Pr(A,—)-Pr(B,+) and 
Pr(A,—)-Pr(B,—), where Pr(A,—)=1—Pr(A,+) and Pr(B,—) = 1—Pr(B,+), result from the 
construction of the compound stimuli. The probabilities of correct and incorrect response to these 
four classes of the compound stimuli are: 
Pr(AB,+|A,+;B,+)[Pr(A,+)-Pr(B,+)] = (.5)[Pr(A,+)-Pr(B,+)+Pr(Da>d)-Pr(B,+) 
+Pr(Dg>d)-Pr(A,+)— Pr(Da>d)-Pr(Dp>d)] [3] 


Pr(AB,—|A,+;B,+)[Pr(A,+)-Pr(B,+)] = Pr(A,+)-Pr(B,+)[1— Pr(AB,+/|A,+;B,+)] [4] 
Pr(AB,+/A,+;B,—)[Pr(A,+)-Pr(B,—)] = (.5)[Pr(A,+)-Pr(B,+) — Pr(D4>d)-Pr(B,+) 
—Pr(Dg>d)-Pr(A,+)+Pr(Da>d)] [5] 
Pr(AB,—|A,+;B,—)[Pr(A,+)-Pr(B,—)] = Pr(A,+)-Pr(B,—)[1— Pr(AB,+/A,+;B,—)] [6] 
Pr(AB,+|A,—;B,+)[Pr(A,—)-Pr(B,+)] = (.5)[Pr(A,+)-Pr(B,-+)— Pr(Da>d)-Pr(B,+) 
—Pr(Dp>d)-Pr(A,+)+Pr(Dg>d)] [7] 
Pr(AB,—|A,—;B,+)[Pr(A,—)-Pr(B,+)] = Pr(A,—)-Pr(B,+)[1—Pr(AB,+|A,—;B,+)] [8] 


Pr(AB,+|A,—;B,—)[Pr(A,—)-Pr(B,—)] = (.5)[Pr(A,+)-Pr(B,+) — Pr(Da>d)-Pr(B,+) 
—Pr(Dp2>4)-Pr(A,+)+Pr(Da>d)-Pr(Da>d)] (9) 


and 


Pr(AB,—|A,—;B,—)[Pr(A,—)-Pr(B,—)] = Pr(A,—)-Pr(B,—)[1—Pr(AB,+/A,—;B,—)] [10] 


In addition, the total probability of correct response to the compound stimuli, arrived at by summing 
Equations 3, 5, 7, and 9, is 
Pr(AB,+) = 2Pr(A,+)-Pr(B,+)+Pr(Da>d)[(.5)— Pr(B,+)] 

+Pr(Dg>d)[(.5)—Pr(A,+)] [ft] 


The purpose of the present experiment is to investigate the effects of a shift in motivation on 
performance in the light of the model. Consider Stimulus A where Pr(Da>d) and Pr(Da<—d) 
have known values under the standard motivational conditions of the experiment. According to a 
multiplicative relationship between H and D, an increase in D should result in an increase in |Da|! 
and tend to increase both Pr(Da>d) and Pr(Da<—d). Conversely, a decrease in D would be 
expected to decrease both parameters. Corresponding effects should occur for Stimulus B. 

An increase in D has the following implications for Equations 1-11: Pr(A,+) and Pr(B,+) involve 
compensating effects and should not change appreciably; Pr(AB,+|A,+;B,+)[Pr(A,+)-Pr(B,+) ] 
and Pr(AB,—|A,—;B,—)[Pr(A,—)-Pr(B,—)] should increase; Pr(AB,—|A,+;B,+)[Pr(A,+)- 
Pr(B,+)] and Pr(AB,+|A,—;B,—)[Pr(A,—)-Pr(B,—)] should decrease while Pr(AB,+|A,+;B,—) 
CPr(A,+)-Pr(B,—)], Pr(AB,—!A,+;B,—)[Pr(A,+)-Pr(B,—)], Pr(AB,+!A,—;B,+)[Pr(A,—)- 
Pr(B,+)], Pr(AB,—|A,—;B,+)[Pr(A,—)-Pr(B,+)], and Pr(AB,+) are indeterminate because of 
a lack of knowledge of the relative magnitude of the effects. The experiment is designed to test the 
predicted shifts in the theoretical parameters, Pr(Da>d), Pr(Da<—d), Pr(Ds>d), and 
Pr(Ds<—d) and in the observed values of the four relevant categories of the compound stimuli. 





METHOD ing in the experiment. ‘Twelve Ss were dis- 

carded for failure to follow instructions. 
Subjects —The Ss were 108 males obtained Apparatus.—In order to provide learning, 
from the introductory psychology class and two sets of stimuli, S, = letter pairs (L) and 
paid at the rate of $1.00 per hour for participat- Ss = number pairs (N), of 20 items each were 
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presented individually in a random sequence 
on flash cards for 10 sec. with the correct re- 
sponse exposed during the first 5 sec. One-half 
of the letter pairs (S41) and one-half of the num- 
ber pairs (Spi) were to be associated with the 
response, JIX, and the other half of each set 
(Saz) and (Spe) were to be associated with the 
response, DAC. The 20 letter pairs were 
constructed by pairing the 10 letters, B, F, G, 
H, K, N, Q, S, Y, and Z, in such a fashion that 
each letter appeared in the first position twice, 
once to be associated with JIX and once with 
DAC, and in the second position twice, again 
once to be associated with each response. No 
two letters were ever paired with each other more 
than once. The 10 digits, 0 through 9, were 
treated in like manner. 

Three sets of compound stimuli (LN) were 
constructed for use in the test series by joining 
on one flash card a letter pair and a number 
pair, chosen at random without replacement 
under the restriction that the same response be 
correct for both the letter pair and the number 
pair. Table 1 shows the complete set of stimuli 
and the assignment to the two response 
categories. 

Procedure.—Twelve groups, composed of ap- 
proximately nine Ss each, participated as units 
in the experiment. The Ss were introduced 


to the apparatus and instructed that they were 


to learn the correct response, JIX or DAC, to 
each of the 40 stimuli. Learning series were 
alternated with test series, and Ss recorded their 
responses on prepared sheets of paper only 
during the test series when the stimuli alone were 
presented. Two learning and three test series 
were given. In the first learning series, each 
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of the 40 stimuli along with the correct response 
was shown once while the second learning series 
allowed two trials on each of the 40 S-R com- 
binations. 

Each test series was made up of the 40 single 
stimulus pairs and 20 compound stimuli as 
indicated in Table 1. After the second test all 
Ss received a third test without an intervening 
learning series. Between the second and 
third tests Ss in the Control group (N = 44) 
were told only that they would receive another 
test immediately whereas Ss in the Increased 
Motivation (IM) group (N = 52) were in- 
structed that they were to be paired at random 
with each other and that an extra dollar would 
be awarded to that member of each pair who 
scored higher on the next test, to be given 
immediately. These instructions for the IM 
group were adapted from those used by 
Atkinson (2). 


RESULTS 


The differences in performance for 
the IM and Control groups on Tests 2 
and 3 relate directly to the hypothesis 
under test. In order to gain more 
information, both about the model 
and about the effect of the motivation 
shift, each group was divided at its 
median number of compound stimuli 
correct on Test 2 to form a High and 
a Low subgroup for the two condi- 
tions. Equal-sized subgroups were 
obtained by assigning at random two 


TABLE 1 


ASSIGNMENT OF THE LetrerR Parrs AND NuMBER Parrs To THE Two Response CaTEGorIEs, 
JILX anv DAC, anv THE Compounp Stimuut Usep tn THE Turee Tests 
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510 


DAVID BIRCH 


TABLE 2 


Osservep Proportion oF Correct AND Incorrect RESPONSES BY THE Two LEVELS OF THE 
IncrEasep Motivation anv Controt Groups To THE LetrerRs, NUMBERS, 
anv Compounp STimu.i For Tests 2 anv 3 
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Ss in the IM group and one S in the 
Control group with scores of 14 (the 
median scores for the two groups) 
to the Low subgroups. Table 2 
contains the observed proportion of 
correct and incorrect responses by 
the two levels of the IM and Control 
groups on the Letters, Numbers, 
Compound Total, and the four classes 
of the compound stimuli for Tests 
2 and 3. The bases for the proportions 
are 520 (26 Ss X 20R’s) for the IM 
subgroups and 440 (22 Ss XK 20R’s) 
for the Control subgroups. 

The patterns of change from Test 2 
to Test 3 over the four subgroups for 
the Letters, Numbers, and Compound 
Total are irregular. More important 
to the test of the theory, however, 
are the changes in the responses for 
the four relevant categories of the 
compound stimuli. Table 3 presents 
the Test 3 minus Test 2 differ- 
ences for the two levels of the IM 


and Control groups for these cate- 
gories classified in terms of the 
(L,+;N,+) and (L,—;N,—) stimuli 
and the response direction to LN 
same as, or opposite to, that of the 
elements. According to the theory 
it is anticipated that the dif- 
ferences for the IM group should 
be positive for 


(LN,+|L,+;N,+) 
[Pr(L,+)-Pr(N,+) ] 
and 
(LN, —|L,—;N,—) 
[Pr(L, —)-Pr(N, —)], 
and negative for 


(LN, —!|L,+;N,+) 
[Pr(L,+)-Pr(N,+)] 
and 
(LN,+|L,— iN, —) 
~~ ([Pr(L,—)-Pr(N, —)]. 
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That this general pattern appears to 
hold for the IM group and a some- 
what different pattern for the Control 
group may be seen in Table 3. 

Table 4 summarizes the analysis 
of variance for the four dimensional 
factorial design of Table 3. Four 
Test 3 minus Test 2 difference scores, 
one for each category of concern, 
were determined for each of the 96 Ss, 
and the differences for categories 


(LN, — |L,+ N,+) 
[Pr(L,+)-Pr(N,+)] 


and 


(LN,+|L,—;N,—) 
[Pr(L, —)-Pr(N,—)] 


were multiplied by —1 so that the 
positive scores would indicate the 
direction of the hypothesis. 

The main effect for Motivation 
Conditions fails to reach the 5% level 
of significance (F = 3.15, 1 and 92 df, 
P< .10), but the MC X L inter- 


action is significant at the 1% level. 


TABLE 3 


Osservep Test 3 Minus Test 2 Proportion 
DIFFERENCES FOR THE Two LEVELS OF THE 
IncrEAseD Motivation anp ConrTROL 
Groups on THE Four Se.ectep Carte- 
corres OF TaBLe 2, CLASSIFIED IN 
Terms oF Stimuut, (L,+;N,+) anp 
(L,—;N,—), anp REsPonsEs To 
LN, Same or Opposite TO THE 
ELEMENTS OF THE STIMULI 


j 
Increased 


Motivation Control 


Motiv. | Stimuli Resqease Response 


to LN | to LN 
nensil Oppo- Same Oppo- 
| site site 


it +;N N+) 058 —.068} .007 
— 004) —.005 


(L,—;N,—)].011 Ea 031 
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I(L,+;N,+)|.008 |— 
(L,—;N,—)|.014 | .006 | 
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TABLE 4 


ANALYSIS OF VARIANCE ON THE Test 3 Minus 
Test 2 DirFeRENCES FOR THE Four 
SELecTeD CATEGORIES OF THE 
Compounp Stimu.t 








Source 





Between Ss 
Motivation Cond. 
(MC) 
Levels (L) 
MC X L 
Error (b) 
Within/Ss 
Stimuli (S) 
Responses (R) 
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Sx 
L 
w 


~ 


Note. 
*P = 
—-P = o. 


> squares for error in parentheses. 


Because of the significant MC X L 
interaction, separate tests between 
the two motivating conditions were 
carried out at the High and Low 
levels. The mean Test 3 minus Test 
2 difference of .102 for the IM-High 
subgroup differs significantly from 
the mean difference of —.075 for 
the Control-High subgroups at the 
5% level (F = 10.51, 1 and 92 df) 
while the mean differences of .044 
and .084 for the IM-Low and Control- 
Low subgroups, respectively, yields 
an F less than 1. Of the Within Ss 
components of variance only the 
LxXS and MXLXSXR inter- 
actions reach significance at the 5% 
level. 

The values of the theoretical pa- 
rameters, Pr(D,>d), Pr(Di< —d), 
Pr(Dy>d), and Pr(Dy<-—d), ob- 
tained from Equations 3-10 in con- 
junction with the data of Table 2, 
are shown in Table 5. In solving 
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TABLE 5 


Dertivep Vauues oF Pr(D_>d), Pr(Di<—d), Pr(Dy>d) anv Pr(Dy<—d) ror Tue Two 
LEVELs oF THE INcREASED Motivation AND Conrrot Groups on Tests 2 anp 3, 
C.assiFiep 1n Terms oF Stimutt AND Drrection oF DirreRENcE 
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TABLE 6 


Prepictep Proportion oF Correct AND INcorREcT RESPONSES BY THE Two LEVELS OF THE 
IncreEaseD Motivation AND ContrRo.t Groups To THE CompouNpD STIMULI 
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for the values of Table 5, all eight 
equations were used in contrast to 
the method previously employed (4). 
The values of Table 5 were sub- 
stituted in Equations 3-10 to arrive 
at the predicted values of Table 6. 
The adequacy of the fit is indicated 
by the difference in observed and 
predicted scores and the root mean 
squares of the differences presented 
at the bottom of Table 6. 

With some confidence in the fit 
of the model to the data, an examina- 
tion of the shifts in‘ the theoretical 
values of Table 5 is in order. For the 
IM-High subgroup all four values 
increase from Test 2 to Test 3. How- 
ever, with the exception of the shift 
in Pr(Di< —d) for the Control-High 
subgroup all other values of Table 
5 decrease from Test 2 to Test 3. 
An analysis of variance, summarized 
in Table 7, was carried out on the 


TABLE 7 


SUMMARY OF THE ANALYsIS OF VARIANCE ON THE 
Test 3 Minus Test 2 DirreRENceEs FoR 
Derivep Vautues or Pr(D,>d), 
Pr(Di< —d), Pr(Dy>d) 
anp Pr(Dw< —d) 
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factorial combination of the Motiva- 
tion Conditions, Test 2 Performance 
Levels, Stimuli (Letters and Num- 
bers), and Response Direction (D>d 
and D< —d) dichotomies to evaluate 
the Test 2 to Test 3 shifts. This 
analysis indicates no significant differ- 
ences attributable to the Stimuli or 
Response Direction factors, either as 
main effects or in interactions. Ac- 
cordingly, the Within Subgroups vari- 
ation was pooled to provide the error 
term for testing the Between Sub- 
groups components, each of which 
is significant beyond the 5% level. 
The over-all mean difference in 
shift for the IM group is .019 and for 
the Control group, —.048. Separate 
tests between the two motivating 
conditions show that the IM-High 
mean shift of .094 differs from the 
Control-High mean shift of —.036 
at the .5% level of significance 
(F = 15.95, 1 and 12 df), whereas 
for the Lows, the IM and Control 
mean shifts of —.056 and —.058, 
respectively, yield an F less than 1. 
This pattern of significance of differ- 
ences between the motivating condi- 
tions for the two levels of performance 
on Test 2 is the same as was found 
in the analysis of the Table 3 data. 


Discussion 


The present study, designed to ex- 
amine the effects of a shift in moti- 
vation on a complex task, points up 
the intricacies of such investigations. 
By definition, a complex task is one in 
which more than one response tendency 
may be operative in producing the 
response and, as such, calls for an 
analysis of the response hierarchy if 
successful predictions are to be made. 
Such an analysis was made in the present 
study by a model for response tendency 
combination, and the fit of the model 
to the data appears quite satisfactory. 

The model, which, in conjunction 
with the Hullian postulate of D X H, 
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forms the basis for the analysis of the 
data, suggests that the direction of 
the gross shifts of Pr(L,+), Pr(N,+) 
and Pr(LN,+) as a function of change 
in D are indeterminate because of com- 
pensating effects among the components. 
That no uniform direction of shift oc- 
curred for these values in the present 
data is shown clearly in Table 2. 

The model does indicate, however, 
that consistent, differential shifts for 
the IM and Control groups from Test 2 
to Test 3 should be found in two aspects 
of the data. Changes in D, presumed 
to be effected by the differential in- 
structions to the two groups, should be 
manifested by differential shifts for 
the groups in the theoretical values, 
Pr(Di> 4), Pr(Di< —d), Pr(Dy>d), 
and Pr(Dxj<—d), and in certain cate- 
gories of the compound stimuli. Analy- 
sis of the data tend to bear out the 
theory in both instances. 

An over-all increase in the theoretical 
values for the IM group as a function 
of increased D is as expected from the 
theory while the over-all decrease in 
these values for the Control group sug- 
gests that the instructions to this group 
may have been D decreasing. The data 
from the compound stimuli relevant to 
the hypothesis, 


Pr(LN,+|L,+;N,+) 
[Pr(L,+)-Pr(N,+)], 
Pr(LN,—|L,+;N,+) 
[Pr(L,+)-Pr(N,+)], 
Pr(LN,+(|L,— 5) 
[Pr(L,— ):Pr(N,— )] 


and 


Pr(LN,—|L,— ;N,—) 
[Pr(L,—)-Pr(N,—)] 


show a pattern of shift for the IM group 
predicted by an increased D while the 
Control group tends to show no shift. 
Analyses on both sets of data, how- 
ever, show an interaction effect between 
the motivating conditions and the level 
of performance on Test 2. It is clear 
that the theory is supported in both 
cases by the results from the IM-High 
and Control-High subgroups. The re- 
sults for the IM-Low and Control-Low 
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subgroups are not equally clear since 
there appears to be no difference be- 
tween these two subgroups in either 
analysis. This result would not be at 
variance with the theory, however, 
if it could be shown either that the 
motivating instructions affect the High 
and Low subgroups differentially or that 
Dx and Dy for the Low subgroups are 
not sufficiently large to be affected sig- 
nificantly by the change in D employed. 


SUMMARY 


An hypothesis, relating to the effects of a 
shift in motivation on the performance of a 
complex learning task, was derived from the 
Hullian postulate of D X H in conjunction with 
a model for response tendency combination. 
A total of 96 men were divided into two groups 
and allowed equal opportunity to learn to 
categorize letter pairs and number pairs into 
one of two response classes. Both groups were 
tested on the letter and number pairs and on 
letter-number compound stimuli before and 
after differential motivating instructions. Anal- 
yses of the shifts in performance for the two 
groups on two aspects of the data—the theoreti- 
cal parameters of the model and the relevant 
categories of the compound stimuli—support the 
use of the model in gaining information about the 
response hierarchy and also the general hypoth- 
esis of a direct relationship between Dand specific 
response tendencies. The results emphasize 
the importance of a knowledge of the response 
hierarchy in the experiments dealing with 
complex learning tasks since gross measures 
of performance may cover up or disguise sys- 
tematic variation in the components of these 
measures. . 
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THE ACQUISITION OF IMITATIVE AND OPPOSITION 
RESPONSES UNDER TWO CONDITIONS OF 
INSTRUCTION-INDUCED SET! 

VERA T. KANAREFF ann JOHN T. LANZETTA 
Fels Group Dynamics Center, University of Delaware 


In several recent studies of imita- 
tive behavior (6, 7) a lower proportion 
of imitative responses during acquisi- 
tion was obtained for adult Ss than 
in the early Miller and Dollard 
studies of children and rats (5). To 
account for this poorer performance, 
Schein (6) proposed that adults may 
associate guilt or anxiety with imita- 
tive behavior in a “test” situation. 
A series of studies by Luchins (4) 
indirectly supports Schein’s specula- 
tion by demonstrating that acquisi- 
tion of imitation in older children 
may also be affected by guilt or 
expectations of punishment. In 
Luchins’ studies, even when a con- 
federate gave obviously true answers, 
not all Ss consistently agreed with 
these answers.. Confirmation of the 
partner’s responses and clarification 
of the evidence were attempted in 
order to obtain 100% agreement, 
but consistent agreement resulted 
only when Ss were given some means 
for direct measurement. Subsequent 
interviews indicated that those Ss 
who did not imitate consistently 
thought they might “get into trouble” 
or be considered “cheats” if they did 
imitate. 

The above results suggest at least 
two variables that may affect the 
rate and level of acquisition of an 
imitative response in a choice situa- 


1 The writers wish to express their apprecia- 
tion to Stuart Cook for his discussion concerning 
the variables, to Jo Anne Davis for her assistance 
in collecting and analyzing the data, and to 
Seymour Berger for his assistance in conducting 
pilot research on this problem. 


tion: the instrumental values of the 
imitative response and the other 
response alternatives and the antici- 
pated social consequences (e.g., social 
approval or disapproval) associated 
with these behavior alternatives. Con- 
flict will presumably arise in those 
situations in which a given behavior 
pattern such as imitation is instru- 
mental to goal achievement but 
where negative sanctions for that 
behavior pattern are anticipated. For 
example, either expectations that 
imitation is not legitimate or anxiety 
previously conditioned to imitation 
may be elicited in test-like situations, 
and yet in some of these test-like™ 
situations imitation may be the only 
effective means for doing well on the 
task. Since expectations of disap- 
proval should increase as the number 
of imitative responses increases one 
might anticipate that a general method 
of resolution of such conflicts would 
be an oscillating behavior sequence. 
The total number of imitative re- 
sponses in these sequences would be 
less than would be obtained under 
neutral or positive sanctions but 
greater than would be obtained if 
imitation were no more instrumental 
to goal achievement than any other 
response alternative. 

In the present investigation, the 
acquisition of imitation and opposi- 
tion responses in adult Ss is examined 
under two instruction-induced sets 
differing in the extent to which guilt 
for imitative responses would ~* be 
elicited: one set of instructions was 
designed to induce negative sanctions 
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towards imitative behavior, the other 
was designed to induce neutral or 
slightly positive sanctions. In addi- 
tion, the instrumental value of imita- 
tion was manipulated by varying the 
percentage of partner’s choices which 
were considered correct; i.e., the 
three probabilities of occurrence of 
reinforcement for imitation were .80, 
.50, and .20. 

The primary hypothesis under in- 
vestigation may be stated as follows: 
As the instrumental value of imita- 
tion increases, the rate and level of 
acquisition of an imitative response 
will increase; the extent of this 
increase, however, will be a decreasing 
function of the strength of the pre- 
vailing negative sanctions towards 
imitation. Thus, the extent of in- 
crease in the rate and level of acquisi- 
tion of imitation as the probability 
of reinforcement for imitation in- 
creases from .20 to .80 should be 
greater under the instructions which 
induce neutral or slightly positive 
sanctions towards imitation than 
under the instructions which induce 
negative sanctions towards imitation. 

In addition to a test of this major 
hypothesis several subsidiary hypoth- 
eses concerning the resolution of the 
induced conflict will be explored. 
Under conditions whereby imitation 
is negatively sanctioned but is instru- 
mental to goal achievement, a con- 
flict will be engendered; Ss must 
choose between conforming to social 
norms and performing poorly on the 
task or employing a nonsanctioned 
behavior pattern and performing well. 
Those Ss who imitate may reduce the 
conflict or the anxiety associated 
with it by denying the existence of 
the negative sanctions, underestimat- 
ing the extent of imitation, or under- 
estimating their own success; whereas 
those Ss who do not imitate may re- 
duce the conflict or anxiety by lower- 
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ing their level of aspiration, over- 
estimating their own success, under- 
estimating their partner’s success, 
denying the importance of the im- 
mediate task, or any combination of 
the above. Questionnaire data will 
provide exploratory information for 
the consideration of these subsidiary 
hypotheses. 


MeETHOD 
General Procedure 


The Ss, in pairs, performed a pitch discrimina- 
tion task ostensibly designed to improve existing 
methods for the identification of underwater 
craft. No communication between Ss was 
possible during the actual test period, for they 
were seated in separate, partially enclosed 
booths. A small metal response box containing 
two response keys and a pair of signal lights 
was located in each booth. The S indicated 
his judgment on each pitch discrimination by 
pressing the appropriate key on the response 
box. The two Ss were told they would give 
their judgments in a fixed order and that their 
partner’s judgment would be indicated by the 
colored lights above his response keys. Since 
both S’s were instructed that their partner would 
give his judgment first and since the response box 
lights in both of the booths were connected to a 
central control console, E could feed in a program- 
med sequence of “partner’s”’ judgments to each 
S. In post-session debriefings, Ss revealed no 
suspicions that the E-controlled responses of the 
partner were not those of the actual partner. 
The correctness of S’s choice was indicated by the 
presence or absence of a white light located on 
the front panel of his booth; this was controlled 
by E from the central console. In 60 pairs of 
tones (the critical trials) both the standard and 
the comparison tone were the same; there were 
15 pairs of tones each at 800, 900, 1000, 1100 
cps. In 15 other pairs of tones the second tone 
differed in pitch from the standard tone; the 
standard tone was either 800, 900, 1000, or 1100 
cps while the second tone was either above or 
below the standard. The pairs of different 
tones were randomly interspersed among the 
critical trials to prevent Ss from suspecting 
that both tones in a pair might be the same. A 
random sequence of the entire set of 75 pairs 
of tones was recorded on tape. 

The experimental procedure is readily ap- 
parent in that portion of the instructions which 
follows the introductory setting: “In this session 
two different tones will be presented, one at a 
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time, over the loudspeaker. Your task is to 
judge whether the second tone is higher or lower 
than the first one. The two tones will always 
be different, that is, the second tone is always 
either higher or lower than the first one. The 
differences, however, may be very small or 
imbedded in a noisy background, for we are 
partic:larly interested in situations where 
judgments are very difficult to make. It is in 
situations such as these that the number of 
judges will be critical. If there are times that 
you do not detect a difference, make a guess. 
The difference may be so subtle that you may 
not be aware that you noticed it. Although 
the discriminations are difficult, most of our 
subjects readily learn to make finer judgments 
with practice and get at least 80% of the judg- 
ments correct. Some people, however, can 
learn much more rapidly than others.” 

At this point Ss were sent to separate, 
partially enclosed booths to read additional 
instructions. The instructions in the booths 
were identical for both Ss; i.e., each S was told 
to give his judgments after his partner gave his, 
as follows (in this manner, E could control the 
sequence of “partner’s’’ responses and run two 
Ss simultaneously) : 

“The person sitting in this booth is the second 
person in your group—Subject B. Briefly, 
this is the procedure you are to follow: (a) 
First you will hear three low-pitched tones 
in rapid succession. This will be warning signal 
1. (6) A few seconds after warning signal 1, 
the two tones to be judged will be presented, 
one after the other. (c) Your partner, Subject 
A, will make his judgment immediately. His 
judgment must be made before warning signal 2 
is presented. Warning signal 2 consists of three 
high-pitched tones in rapid succession. (4) 

fou will have ten seconds after warning signal 2 
to make your judgment. (Please do not make 
your judgment before warning signal 2. The 
experimenter will be unable to record your 
judgment unless it is made after warning signal 
2.) (¢) Ten seconds after warning signal 2 you 
will again hear warning signal | to alert you for 
the next judgment. 

“Use the small metal box located in your 
booth to indicate your judgment. Press the 
button at the bottom left (marked ‘higher’) 
if you judge the second tone to be higher than 
the first; press the button on the bottom right 
(marked ‘lower’) if you judge the second tone 
to be lower than the first. 

“The row of lights above these buttons 
represents your partner’s choices. When the 
green light (located above the ‘higher’ response 
button) goes on, it means that your partner, 
Subject A, has judged the second tone to be 
higher than the first. When the red light (lo- 
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cated above the ‘lower’ response button) goes 
on, it means that he has judged the second tone 
to be lower than the first. 

“After you have made your judgment, 
toatch the light on the panel wall in front of you. 
If your judgment was correct, a white light 
will flash on; if your judgment was incorrect, 
the white light will not appear.” 

The E then briefly summarized the task 
procedure, gave Ss five practice trials without 
reinforcement, and then proceeded with the test 
series. 


Experimental Conditions 


Percentage of “partner's” judgments called 
correct (probability of reinforcement).—The se- 
quence of judgments considered correct for the 60 
pairs of identical tones and the corresponding 
three response sequences for the “partner’’ were 
randomized within blocks of 10 trials with the 
restriction that there be an equal number of 
higher and lower judgments. Where equal 
distribution of responses among the higher and 
lower judgments was not possible, the odd judg- 
ments were balanced over blocks of trials. 
The three “partner”’-response sequences form the 
three probabilities of reinforcement, .80, .50, and 
.20; i.e., in one sequence, 80% of the “‘partner’s” 
judgments are correct; in another sequence, 
50% of them are correct; in the third sequence, 
only 20% of them are correct. These proba- 
bilities of reinforcement are taken with respect 
to the 60 critical trials. Since the “partner’s” 
judgments were always correct on the remaining 
15 trials, the probabilities of reinforcement for the 
entire sequence are .84, .60, and .36. Whether 
S was correct or not depended on whether he 
agreed with the “partner” or disagreed with him 
and whether the “partner’s” judgment happened 
to be correct or incorrect on that particular 
trial. If S agreed consistently with the “partner” 
throughout all the trials, he would have been 
correct 8 times in each block of 10 trials when 
80% of the “partner’s” judgments were correct, 
5 times per block of 10 trials when 50% of the 
“partner's” judgments were correct, and 2 times 
per block of 10 trials when 20% of the “partner’s” 
judgments were correct. Accordingly, imita- 
tion, or consistent agreement with the “partner,” 
would be a behavior pattern instrumental to 
goal achievement in the .80 condition, and 
opposition, or consistent disagreement with the 
“partner,” would be the instrumental behavior 
pattern in the .20 condition. 

For the 15 pairs of different tones, the “part- 
ner” always judged correctly and S was correct 
if he agreed with his “partner.” 

Instruction-induced set—That portion of the 
introductory setting common to both sets of 
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instructions was as follows: “Here is an illustra- 
tion of the type of problem that we are studying. 
Our national and civil defense agencies have 
long been faced with the problem of identifica- 
tion of hostile and friendly underwater craft. 
Errors in judgment in identifying such under- 
water craft have led to instances where we have 
sunk our own submarines or allowed enemy 
craft to pass through our territory. Even our 
modern sonar equipment cannot correct for 
similar kinds of errors of judgment.” 

Negative sanctions towards imitative be- 
havior were introduced in Instructions II by 
supplementing this common portion with the 
following: “We ave attempting to determine 
whether the judgments made by a group of 
people are more reliable, more accurate, than 
the judgments of one person alone. We are, 
in addition, interested in the problem of how 
many people are needed to obtain accurate 
judgments. The problem may be posed as fol- 
lows : Suppose that two people individually judge 
the same thing. How much difference in 
accuracy is there in their separate judgments 
and are their separate judgments more accurate 
than their combined judgments?” 

In addition, the statement “This will be a 
test of your ability to discriminate tones in that 
we will see how well you do in comparison with 
each other” was included near the end of Instruc- 
tions II, just prior to the practice series. 

Neutral or slightly positive sanctions towards 
imitative behavior were introduced in Instruc- 
tions I by supplementing the common portion 
with the following: “We are attempting to 
determine how much of an actual difference is 
necessary under varying conditions of distrac- 
tion in order for people to recognize these dif- 
ferences. A related problem is the development 
of effective means for improving judgments 
that are difficult to make. One possibility is 
that working in a group may improve one’s 
ability to distinguish differences when conditions 
for making such judgments are poor.” 


Experimental Design 


The three probabilities of reinforcement and 
the two sets of instructions form a 2 X 3 orthog- 
onal design. Twenty-four female and 24 male 
volunteers from the freshman English classes 
at the University of Delaware were randomly 
assigned in like-sexed pairs to the six cells of 
the design with the restriction that each cell 
contain two female pairs and two male pairs. 

The number of imitative responses (judg- 
ments indicating agreement with the partner) 
per block of 10 trials in the critical series is the 
major variable under observation. A question- 
naire assessing S’s aspiration level and estimates 
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of his success, of his partner’s success, and of the 
amount of agreement with the partner was also 
used to provide exploratory data about S’s 
expectations in the experimental situation. A 
continuous rating scale ranging from 100% to 
0%, or “applies very much” to “doesn’t apply 
at all,” accompanied the following questions 
and statements in the questionnaire :* 


1. Before you even began to make any 
judgments, what percentage did you expect to 
judge correctly? 

2. During the period of actually making 
judgments, what percentage did you expect 
to judge correctly? 

3. To what extent were you certain that 
you’d reach the level checked in question 2? 

4. On each trial, I made a very strong effort 
to keep from making a mistake. 

5. On the whole, I made a very strong effort 
to keep from getting below the level checked 
in question 2. 

6. What percentage of your judgments did 
you judge correctly? 

7. What percentage of his judgments did 
your partner judge correctly? 

8. What percentage of your judgments were 
in agreement with those of your partner? 

9. To what extent did you feel frustrated 
during the session? 

10. To what extent did you feel that you 
were competing with your partner? 


RESULTS 
Imitative Responses 


An analysis of variance and 
orthogonal polynomial analyses of 
trend (3) were applied to the number 
of imitative responses per block of 
ten trials in the series of 60 critical 
trials. 

The summary of the analysis of 
variance is presented in Table 1. 
Results from the Duncan Range test 
(1) on the Instructions X Probability 
of Reinforcement interaction means 
are given in Table 2. The three 
probability of reinforcement means 
under Instructions I differ signifi- 
cantly from one another; the largest 
amount of imitative responses occurs 
when imitation is instrumental to 


2Copies of this questionnaire are available 
from the authors upon request. 
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TABLE 1 


ANALYsIs OF VARIANCE OF THE 
ImrraTIVE RESPONSES 























Source df | MS F P 

Instructions 1) 1.00 
Probability Ratio 2 | 54.19 | 12.04 
Trials 5| .08 

1 .00 
IX PR 2 | 43.92| 9.76) <.01 
IXT 5} 4.61} 2.11 
Ixs 1 43 
PR X T 10} 5.01} 2.30} <.05 
PR XS 2) 2.97 
Txs 5} 2.31) 1.06 
IX PR X T 10} 3.62) 1.66 
IXPRxXS 2| 2.36 
PRXTXS 10} 1.93 
IxTxs 5} 1.56 
IXPRXTXS 10} 2.84] 1.30 
Ss within 36| 4.50} 2.11} <.01 
Ss within X Trials 180; 2.18 
Total 287 | 








Note:—The Ss-within cells is used as the error term 
for nonrepeated measures; Ss within X Trials is used as 
the error term for all others. The significant terms 
are then tested against the appropriate significant 
higher-order interactions where such interactions 
exist. 


goal achievement (.80 reinforcement) 
and the least amount occurs when 
opposition is instrumental (.20 rein- 


forcement). None of the probability 
of reinforcement means, however, 
differ under Instructions II. In addi- 


tion, the two instructions means differ 
significantly within the .80 and .20 
reinforcement conditions but not with- 
in the .50 reinforcement condition: 
There are more imitative responses 
under the instructions inducing neu- 


TABLE 2 


Duncan Rance Test Resutts on THE INstTRUC- 
TIONS BY ProBaBILiTy OF REINFORCEMENT 
Imrration Means* 








Probability of Reinforcement 








Instructions - 
| 2 | #0 | 80 
1 | 4460 | 54m | 7,270 
Il ; 5,54» 5.58> 5.69 





* Those means marked with a letter in common do 
not differ significantly at the .05 level. 
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tral or slightly positive sanctions 
towards imitation (Instructions I) 
than there are under the instructions 
inducing negative sanctions towards 
imitation (Instructions II) when imi- 
tation is instrumental; there are less 
imitative responses under Instruc- 
tions I than under Instructions II 
when opposition is instrumental; and 
there are no significant differences 
between instructions in the amount of 
imitative responses when neither imi- 
tation nor opposition is instrumental. 

The Duncan Range test results on 
the Probability of Reinforcement 
X Trials interaction means in Fig. 1 
indicate that the three curves differ 
from each other, since on at least one 
block of trials the means differ 
significantly. 

Orthogonal analyses of trends were 
applied to the data from Instructions 
I, the data from Instructions II, and 
also the combined instructions data. 
Only the summaries of the analyses 
of the Instructions I data and the 
Instructions II data are presented in 
Tables 3 and 4. As expected, the 
Between Group Means, the Prob- 
ability of Reinforcement, and the 
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per block of trials as a function of probability 
of reinforcement. 
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Probability of Reinforcement X In- 
structions terms are highly significant 
in the orthogonal polynomial analysis 
of trends in the combined data. 
The Between Group Trends is signif- 
icant as well as the linear and 
quadratic components of these trends. 
There are also significant differences 
in trend among the different proba- 
bilities of reinforcement in both the 
linear and the quadratic components. 
The differences in the trends among 
the probabilities of reinforcement 
are evident in Fig. 1: When imitation 
is instrumental, the number of imita- 
tive responses at first increases over 
trials and then drops slightly in 
contrast to the immediate drop in 
imitative responses when opposition 
is instrumental. 
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In addition, the quadratic compo- 
nent of the differences in trend among 
the Instructions X Probability of 
Reinforcement groups is significant. 
These differences in trends are evi- 
dent in Fig. 2. The differences 
among the probability of reinforce- 
ment curves under Instructions I are 
striking. When 


imitation is instru- 
mental, the number of imitative 
responses increases with trials and 


then levels off; when opposition is 
instrumental, the number of imitative 
responses decreases with trials and 
then levels off; when neither imitation 
nor opposition is instrumental, slightly 
more than half the responses on each 
block of trials are imitative responses. 
Under Instructions’ II, both the .80 
and .20 reinforcement conditions show 
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TABLE 3 


Trenp Anatysis oF Iurrative Responses Unper Instructions I 




















| | 
Source af MS | Error Term | F P 
A. Over-all trend (5) (1.91) E 1.16 
1. Linear 1 14 E; 1.40 
2. Quadratic 1 3.33 E, 1.60 . 
3. Deviation from 1 and 2 3 1.36 E; 
B. Between group means 2 97.84 D 23.86 <.01 
C. Between group trends (10) (5.24) E 3.18 <.01 
1. Linear 2 } 11,93 E, 7.80 <.01 
2. Quadratic 2 | 941 Es 4.52 <.05 
3. Deviations from 1 and 2 6 1.62 E; 1.05 
D. Between ind. means 21 4.10 248 | <.0l 
E. Between ind. trends (105) (1.65) | 
1. Linear 21 1.53 
2. Quadratic 21 2.08 | 
3. Deviations from 1 and 2 63 eu 
| | 





Total 143 








an early increase and subsequent 
decrease in imitative responses, where- 
as in the .50 reinforcement condition, 
slightly more than half the responses 
on each block of trials are imitative 
responses except on the last block of 
trials. 

The Between Group Means, the 
Between Group Trends and the 
linear and quadratic components of 
the Between Group Trends give 
significant F’s in the orthogonal 
polynomial trend analysis of the 


Instructions I data.* Duncan Range 
test results indicate that the three 
probability of reinforcement means 
differ significantly from one another. 
The largest amount of imitative 
responses, an average of 7.27 per block 
of 10 trials, occurs when imitation is 
instrumental ; the next largest amount 
of imitative responses, an average of 
5.44 per block of trials, occurs when 

* Refer to Fig. 2 for the differences in trend 


among the probabilities of reinforcement under 
Instructions I and under Instructions II. 


TABLE 4 


Trenp Anatysis or Imitative Responses Unper Instructions II 














Source df MS Error Term FP | P 

A. Over-all trend (5) (2.78) Ee Ff um | 

1. Linear 1 1.80 E; 

2. Quadratic 1 6.79 E; 2.34 

3. Deviations 3 1.77 E; 
B. Between group means 2 .27 
C. Between group trends (10) (3.39) E | 1.24 

1. Linear . 2 4.47 E; 1.56 

2. Quadratic 2 12.08 | E, 4.17 <.05 

3. Deviation 6 13 E; 
D. Between ind. means 21 4.13 E 1.51 <.05 
E. Between ind. trends (105) (2.74) | 

1. Linear 21 2.87 

2. Quadratic 21 2.90 

3. Deviations 63 2.64 
Total 143 
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Cumulative proportions of the imitation response and its reinforcing 


event in the .60 probability of reinforcement condition. 


neither imitation nor opposition is 
instrumental; and the’ smallest 
amount of imitative responses, an 


average of 4.46 per block of 
trials, occurs when opposition is 
instrumental. 


Although the quadratic components 
of the Between Group Trends is 
significant in the orthogonal poly- 
nomial trend analysis of the Instruc- 
tions II data, the Between Group 
Trends is not.‘ Nor do the three 
probability of reinforcement means 
differ significantly. 


Event-Matching Behavior 


Estes (2) has suggested that learn- 
ing theory models could also be used 
to describe learning in social situa- 
tions. One of the deductions from 
his model is the general matching 
law: The cumulative proportions of a 


4 See Footnote 3. 


- 


given response and the corresponding 
reinforcing event tend to equality 
as the number of trials approaches 
infinity. The present study suggests 
that some modifications in the model 
may be necessary as Ss’ response 
probabilities in certain of the condi- 
tions did not reach the probability 
of reinforcement for the response, 
but stabilized some distance below it. 
The cumulative proportions of the 
imitative response and the correspond- 
ing reinforcing event, E,, for the .50 
and .80 reinforcement conditions are 
plotted in Fig. 3 and 4, respectively ; 
the cumulative proportion of the 
opposition response and the corres- 
ponding reinforcing event, Es, for the 
.20 reinforcement condition are plotted 
in Fig. 5. 

Whenever the “partner’s” choice is 
correct, E, is said to occur, and 


E, is said to occur whenever the 
choice is incorrect; thus, 
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Fic. 4. Cumulative proportions of the imitation response and its reinforcing 
event in the .84 probability of reinforcement condition. 
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Fic. 5. Cumulative proportions of the opposition response and its reinforcing 
event in the .36 probability of reinforcement condition. 
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E, occurs when S agrees with his 
“partner” and is called correct (the 
white light is flashed on) and also 
when S opposes his “partner” and is 
called incorrect (the white light is not 
flashed on), whereas E, occurs when S 
agrees with his “partner” and is called 
incorrect, and also when he opposes 
his “partner” and is called correct. 
The cumulative proportions are 
taken over the entire 75-trial se- 


quence. Since the “partner’s” choice 


is correct on all 15 of the noncritical 
trials, the probabilities of reinforce- 
ment for the entire sequence are .84, 
.60, and .36. Hence, the cumulative 
proportion of E, stabilizes around .60 
and .84 in Fig. 3 and 4, respectively, 
while the cumulative proportion of E, 
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stabilizes about .64 in Fig. 5. The 
matching law appears to hold for 
both sets of instructions in the .60 
probability of reinforcement. For the 
.36 and .84 probabilities of reinforce- 
ment, however, the cumulative pro- 
portion of the given response doesn’t 
reach the cumulative proportion of 
its corresponding event but stabilizes 
at about .54 and .74, respectively, 
when neutral or slightly positive 
sanctions for imitation are induced 
(Instructions I) and about .41 and .61, 
respectively, when negative sanctions 
are induced (Instructions II). 


Ouestionnaire Responses 


The intercorrelations among the 
items of the questionnaire are pre- 


TABLE 5 


INTERCORRELATIONS AMONG THE QUESTIONNAIRE ITEMS 








Probability 
of Rein- 


Questionnaire Items 
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sented by probability of reinforce- 
ment in Table 5; the pattern of 
significant relationships among ques- 
tionnaire items clearly varies with 
probability of reinforcement. When 
neither imitation nor opposition is 
instrumental, the only significant 
correlations are between confidence 
in attaining the level of aspiration 
and estimate of own success and be- 
tween over-all effort exerted and 
reported frustration. When imita- 
tion is instrumental, however, a 
prominent cluster of significant inter- 
correlations exists among both levels 
of aspiration, the estimates of own 
success, of partner’s success, and of 
agreement with the partner and low 
frustration. When opposition is in- 
strumental, the prominent cluster 
includes both levels of aspiration, 
confidence in attaining the level of 
aspiration, and estimate of own 
success.® 


5 Since the significant results of the numerous 
additional analyses do not form an obvious 
pattern and are not easily interpretable, they 
will be mentioned only briefly. In the .20 
condition, Ss underestimate percentage of 
agreement with a partner (19.36 mean deviation 
of estimated percentage of agreement from 
actual percentage of agreement) significantly 
more than Ss in the .50 condition (3.25 mean 
deviation), but not significantly more than 
those in the .80 condition (10.50 mean devia- 
tion); the mean deviations for the .50 and 
.80 conditions do not differ significantly. Imita- 
tion and overestimation of partner’s performance 
are positively related (rf =.51) in the .80 
condition, whereas imitation and ongoing level 
of aspiration are positively related (r = .37) 
in the combined .80 and .20 conditions. Con- 
fidence in attaining one’s level of aspiration is 
positively related to overestimation of one’s own 
success (r = .79) in the .20 condition and 
negatively related with underestimation of 
agreement with the partner (r = .—64) in the .80 
condition. In both the .80 and .20 conditions 
the ongoing level of aspiration and overestima- 
tion of own success are positively related 
(r = .59), and confidence in attaining the 
level of aspiration is positively related with 
overestimation of partner’s success (r = .53). 
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Discussion 


It was hypothesized that the utiliza- 
tion of a behavior most effective for 
reaching a task goal would depend upon 
the prevailing social sanctions. As pre- 
dicted, expectations of approval or 
disapproval for imitation did affect the 
acquisition of imitative responses. When 
expectations of neutral or slightly posi- 
tive social sanctions for imitation were 
induced, the rate and level of acquisition 
of imitative responses varied significantly 
with the instrumental value of the 
imitative response. When expectations 
of negative social sanctions were induced, 
these differences in rate and level of 
acquisition were depressed. 

It is apparent from the data on the 
opposition responses, however, that the 
induced sets aroused expectations of 
social sanctions, not only for the utiliza- 
tion of imitative responses per se, but 
also for any utilization of the available 
information about another’s behavior. 
In effect, the inductions modified Ss’ 
set towards independence. In the con- 
dition where the utilization of informa- 
tion about others was perceived as 
legitimate (Instructions I) the two in- 
strumental behaviors, imitation and 
opposition, were also socially sanctioned 
behaviors. In the condition where inde- 
pendence was perceived as the social 
norm (Instructions II) the social sanc- 
tions and instrumental values were 
conflicting; Ss were forced to choose 
between conforming to the social norms 
and performing poorly on the task or 
of employing a nonsanctioned behavior 
and performing well. These Ss ap- 
parently were aware of the elements 
forming the conflict situation. They 
were able to estimate the partner's 
performance as well as Ss in the condi- 
tion where social norms and _ instru- 
mental values were not conflicting,® and 
yet they did not imitate or oppose as 
consistently. It. is possible that Ss 


‘Instructions did not yield a significant F 
ratio in the analyses of variance of estimates of 
partner’s performance and in the deviation of 
partner’s actual success from S’s estimate of the 
partner’s success. 
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oscillated in their use of imitation or 
opposition when they were faced with 
the conflicting sanctions and _ instru- 
mental values. 
analysis employed, however, it was not 
possible to distinguish between an alter- 
nation behavior and completely inde- 
pendent behavior.’ j 
It was also suggested that those -Ss 
who imitate may reduce the conflict, 
or the anxiety associated with it, by 
denying the existence of the negative 
sanctions, underestimating the extent 
of imitation, or underestimating their 
success, whereas those Ss who do not 
imitate may reduce the conflict by either 
lowering their levels of aspiration, or 
overestimating their success, or under- 
estimating their partner’s success, or 
denying the importance of the immediate 
task, or any combination of the above. 
Only when imitation was instrumental 
did one of the predictions hold; imitation 
and overestimation of partner’s per- 


formance were positively related. The 
questionnaire data, on the whole, did 
not substantiate these hypotheses. 


Further analysis of the data was 
undertaken to examine the application 
of Estes’ learning theory model to learn- 
ing in social situations. Strict event- 
matching occurred only when neither 
imitation nor opposition was instru- 
mental to success on the task, whereas, 
the event was undermatched- when 
imitation or opposition was instrumental. 
One possible explanation is that there 
exists a generalized resistance to violat- 
ing the notion that one can do well on his 
own without accepting outside assistance. 
The conflict between accepting possible 
failure by performing on one’s own and 
being assured of success by copying or 
opposing another may then be resolved 
in such a way that Ss as a group under- 
match the event, the extent of the under- 
matching being determined by the 
prevailing social sanctions. 


7An analysis of the number of runs of 
imitative responses indicates that the number 
of runs of imitative responses increases with 
increasing instrumental value of imitation, 
regardless of social sanctions. 


From the methods of 
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Further research is needed to deter- 
mine the conditions which affect the 
development of sanctions favorable to 
an imitative or independence set. Some 
of the variables to be considered are 
the characteristics of the partner, of the 
reward for goal achievement, and of the 
task. Studies of the effect of increasing 
the reward on the acquisition of imita- 
tive and opposition responses and the 
effect of varying the instrumental value 
on both the acquisition and extinction 
of imitative and opposition responses are 
in progress. 


SUMMARY 


The present study tests the hypothesis that, 
although a given behavior—i.e., imitation or 
opposition—may be the most appropriate means 
for reaching an immediate task goal, the utiliza- 
tion of such behaviors depends on the prevailing 
social sanctions for its use. 

Two Ss, seated in separate booths, judged 
whether the second of a pair of identical tones 
in a series of 60 paired tones was higher than 
the first. Each S indicated his judgment by 
pressing the appropriate response key after his 
“‘partner’s” choice was indicated by a light above 
the response keys; if S’s judgment was correct, 
a white light was flashed on. Since each S was 
instructed that his partner would judge first, 
E was able to present a standard randomized 
sequence of “partner’s”’ choices, as well as a pre- 
arranged sequence of reinforcements for imita- 
tion responses. The two, independent variables 
were: (a) Probability of reinforcement for 
imitation. The percentages of the “partner’s” 
choices that were considered correct were either 
80%, 50%, or 20%, i.e., the three probabilities 
of occurrence of reinforcement for imitative 
responses were .80, .50, and .20. (6) Instruc- 
tion-induced set. One set of instructions was 
designed to induce expectations of neutral or 
slightly positive sanctions for imitation, while 
another set was designed to induce expectations 
of negative sanctions. 

As predicted, the extent of increase in the 
rate and level of acquisition of imitation with 
increasing instrumental value was significantly 
greater under positive sanctions towards imita- 
tion than under negative sanctions. It was 
also apparent that the acquisition of opposition 
responses was affected by the instruction-in- 
duced set. This suggests that the instructions 
modified S’s set towards independence and not 
towards the utilization of imitative responses 
per se. 

The questionnaire data did not substantiate 
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the subsidiary hypotheses on the modes of resolu- 
tion of the conflict engendered in the condition 
where social sanctions and instrumental values 
were conflicting. 

The hypothesis advanced by Estes that 
“event-matching” behavior occurs in social 
learning situations was substantiated only in 
the .50 probability of reinforcement condition. 
Undermatching of the event occurred in the 
.80 and .20 conditions, the extent of the under- 
matching being determined by the prevailing 
social sanctions. 


REFERENCES 


1. Duncan, D. B. Multiple range and multiple 
F tests. Biometrics, 1955, 11, 1-41. 

2. Estes, W. K. Of models and men. Amer. 
Psychologist, 1957, 12, 609-617. 


VERA T. KANAREFF AND JOHN T. LANZETTA 


3. Grant, D. A. Analysis of variance tests 
in the analysis and comparison of curves. 
Psychol. Bull., 1956, 53, 141-154. 

. Lucuins, A. $. On agreement with another's 
judgment. J. abnorm. soc. Psychol., 
1944, 39, 97-111. 2 

5. Mitier, N. E., & Dottarp, J. Social learn- 
ing and imitation. New Haven: Yale 
Univer. Press, 1941. 

. Scnern, E. H. The effect of reward on 
adult imitative behavior. j. abnorm. 
soc. Psychol., 1954, 49, 389-395. 

. Scuwartz,N. Referred to in R. R. Bush and 
F. Mosteller, Stochastic models for learn- 
ing. New York: Wiley, 1955. Ch. 12. 


(Received for early publication 
June 18, 1958) 





Journal of Experimental Psychology 
Vol. 56, No. 6, 1958 
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Studies concerned with the be- 
havioral assessment of the neuro- 
physiological condition of the eye 
have employed, among other meas- 


ures, indices of visual sensitivity. © 


This index is frequently determined 
by employing threshold luminances 
for a target that may be exhibited (a) 
continuously, (6) for a brief duration 
as in a flash, or (c) intermittently 
as in flicker. Under well controlled 
test conditions, visual threshold deter- 
minations show a high degree of 
reproducibility and many investiga- 
tors have considered that, especially 
among inexperienced Os, the response 
to flicker is more reliable than other 
types of visual tests (9). 

In routine determinations of critical 
flicker frequencies, the SD from the 
mean is usually found to be less than 
2 cps during an experimental session 
(8,13). This stability of critical flicker 
frequency (CFF), under a standard 
condition of luminance, size, and 
retinal position of the test field, has 
led to the assumption that CFF may 
be regarded as a numerical entity 
and that deviations from the “nor- 
mal”’ denote changes in the physio- 
logical state of the eye or the organism 
as a whole (10, 11). 

One of the important organismic 
variables known to influence CFF 
variation among individuals is chrono- 
logical age. Simonson, Enzer, and 
Blankstein (14), and Brozek and 


1 This study is one of a series sponsored by 
the Commission on Accidental Trauma, Armed 
Forces Epidemiological Board, Department of 
Defense, and supported in part by funds from 
the Office of the Surgeon General, Department 
of the Army. 


Keys (2) report a definite decrease in 
CFF for Os over 40, but little change 
in CFF was found before this age. 
Misiak (12) studied Os ranging in 
age from 7 to 89 yr. and found that 
average CFF -declines rather regu- 
larly from age 20 to 89, but not 
without considerable individual varia- 
tion. A more recent study by Cop- 
pinger (4) has established that the 
CFF-age function is linear and nega- 
tive from the age of 20 to 90. Fur- 
thermore, the decreased sensitivity 
among older Os cannot be entirely 
attributed to diminished pupil size 
since Weekers and Roussel (15), using 
pupil dilation with atropine, find 
that there is still a CFF decrement 
with age. 


Problem 


While it has been generally accepted 
that CFF tends to be negatively 
correlated with age, there has been 
little uniformity in previous studies 


in the characteristics of the test 
field, especially in the light:dark 
ratio (LDR) of the flicker cycle. 
Furthermore, since only a few studies 
have used more than one LDR, 
little systematic information is avail- 
able on the comparative sensitivity 
of the fusion threshold when the 
light-time proportion (LP) of the 
LDR is varied. Simonson, Enzer, 
and Blankstein (14) found an equal 
effectiveness of light-time proportions 
of 64%, 27%, and 17.5% among 
various age groups. Fisher (6) using 
light:dark ratios ranging from 2% 
to 98% reports that elderly Os (76 to 
87 yr.) respond differently from 
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young Os (21 to 24 yr.) to the 
shortening of the dark interval of 
the flicker cycle. 


The present experiment attempts 
to establish more precisely the nature 
of the relationship between foveal 
CFF and chronological age by (a) 
using a number of different light: dark 
ratios, and (b) employing a large 
sample of Os covering an age span 
of 76 yr. 


MetTHop 


Observers.—Critical flicker frequency thresh- 
‘olds were obtained from 108 male Os ranging in 
age from 13 to 89 yr. Among them were 
secondary school and college students, office 
workers and clerks, taxi drivers, and residents 
of old-age homes. All were in good health and 
were nonindigent. The Os who ordinarily wore 
glasses were allowed to wear them during the 
test. The numbers in each age group were as 
follows : 

Average 

Age Range Number 


13-19 10 
20-29 13 
30-38 13 
40-49 16 
50-58 14 
61-68 14 
70-78 13 
80-89 15 


Apparatus.—In most studies on intermittent 
light stimulation, a motor-driven sector disc 
has been used for the production of flicker. 
With this type of apparatus the change of the 
LDR necessitated the use of different discs 
which interrupted the test light at a focal point 
in the optical system of the instrument. More 
recently, electronic devices have been used for 
flicker production. In comparison to the rota- 
tion devices, these have the advantages of being 
easier to operate and of eliminating noise and 
vibration, but they may have the disadvantage 
of having very short and fixed flash durations 
which do not change when frequency is altered 
(General Radio Strobotac). 

For the present investigation, a flicker fusion 
apparatus? which activates a cold cathode 


2Designed by W. Bruce Fisher while on 
sabbatical leave at the Harvard School of 
Public Health from Fresno State College and 
constructed to his specifications by the Grayson- 
Stadler Company, West Concord, Massachu- 
setts. Model E6-22. 
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light modulating tube (Sylvania R1131C) was 
used. The light pulses are of a square wave 
form, and the apparatus permits the adjustment 
of the duty cycle or operation time of the lamp 
to be varied from 2% to 98% of the repetition 
period. The flash frequency is variable from | to 
125 flashes per second. The peak luminance 
is controlled by varying the amperage, and if 
necessary the tube can be illuminated continu- 
ously for use as a standard “steady” light. 

The glow modulator tube was placed behind 
a piece of ground glass which provided a pre- 
dominantly white test area of even illumination 
over its surface. The test area was circular 
with a diameter of 1 cm., and was in the center 
of a circular surround which had a diameter 
of 28 cm. The surround was illuminated by a 
concealed circular neon daylight white tube 
matched in hue to the test area. ‘Two surround 
luminances were employed: 23.6 and .04 ml. 
Perforated shields and a Variac were used to 
change the surround luminance without a color 
change. 

The test area and surround were mounted 
on the rear wall of a rectangular housing. The 
test area served as the fixation point and was 96 
cm. in front of and level with O’s pupils. Thus, 
the test area subtended a visual angle of 36 min. 
of arc, while the surround subtended a visual 
angle of approximately 17°. These angular 
dimensions allowed the image of the test area 
to fall within the fovea. The surround extended 
8.5° into the periphery. 

During the threshold determinations, the 
position of O was comfortably but securely 
maintained by means of a modified dentist's 
chair with back, foot, shoulder, and arm rests. 
The positioning of O was accomplished by 
vertical and lateral chair adjustments by £. 
When O’s eyes were level with the test area, 
his head was then held in a fixed position by 
means of a chin and head rest. 

Procedure.—Each O was first made familiar 
with the distinction between flicker and fusion 
by changing the frequency from steady to 
intermittent stimulation. When he understood 
the nature of the task, practice trials were 
given until the responses to change from “steady 
to flicker” and “flicker to steady” were con- 
sistent. ‘To insure adaptation to the surround 
illumination, O was not allowed to look away 
from the test field during the preliminary trials. 
Binocular CFF’s were then determined by the 
method of limits using discontinuous exposures. 
The duration of each exposure was constant 
at 1.5 sec., and the change of the flicker rate in 
ascending and descending determinations was 
constant at 1 cps. 

In determining each individual threshold, 
readings were taken as the continuously in- 
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creased frequency led to the report of “steady,” 
and also when the frequency was steadily 
decreased until O reported “flicker.” During 
this process the luminance of the test area was 
kept constant while frequency was varied by 
E. As frequency was increased or decreased 
the light duration of the cycle varied propor- 
tionately. Consequently, the light-time frac- 
tion of the flicker cycle remained constant 
regardless of the frequency of intermittence. 
Threshold determinations for each O were 
obtained under 10 light-time proportions: 
2%, 5%, 10%, 25%, 40%, 50%, 75%, W%, 
95%, and 98%. 

In this study the peak current was kept 
constant for all threshold determinations. 
When the tube was illuminated steadily, this 
provided the test area with a mean luminance 


of 21.19 ml., as measured with a Macbeth 
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During the first test session, which took 
approximately 1 hr., CFF’s were determined 
with the 10 LDR’s under two surround lumi- 
nances: 23.6 and .04 ml. At these luminance 
levels, the high surround luminance is brighter 
than any of the uncompensated test area lumi- 
nances, while the .04 ml. surround is dimmer 
than any of the test area luminances. 

A second test session followed one week 
later in which the technique of testing was the 
same but the order of surround luminances was 
reversed. In both sessions the starting order 
for the various LP levels was random. For 
each O, 160 CFF’s were determined. The 
values obtained from 108 Os, averaged for the 
two sessions, constitute the material presented 
in the following tables and figures. 


RESULTS 


illuminometer from the eye position of 0. 

For each LP level, two ascending and two 
descending thresholds were obtained. This 
provided CFF values in cycles per second for 
the frequency level at which flicker disappeared, 
and when flicker reappeared. The four deter- 
minations were averaged as the mean CFF. 


The means and SD’s of the fusion 
thresholds in cycles per second are 
presented in Tables 1 and 2. In- 


spection of the data for the various 
age groups shows that mean CFF 


TABLE 1 


Mean CFF’s anv SD’s ror Eicut Act Rances at 10 Licut: Darx Ratios WHEN 
THE Test Area ApPEARS AT THE CENTER OF A SURROUND OF 23.6 ML. 
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TABLE 2 


Mean CFF’s anv SD’s ror Eicut Ace Groups at 10 Licut: Dark Ratios WHEN 
THE Test Area Appears AT THE CENTER OF A SuRROUND oF .(04 ML. 
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decreases as age increases under all 
LDR conditions and under both 
conditions of surround luminance. 
The SD’s, however, are not ordered 
as systematically as the means. When 
the youngest age group is compared 
to the oldest age group, the younger 
Os havea smaller SD at all LDR levels 
and at both surround luminances. 
This same relationship is also mani- 
fested when the 20-29 yr. group is 
compared to the 70-78 yr. group. 
The computation of Bartlett’s test 
for homogeneity of variance, however, 
indicates that the variances among 
the eight age groups are not signifi- 
cantly different at any LDR level 
under both surround luminances. All 
of the 20 chi-square values derived 
from these data were smaller than 


the value required for significance at 
the 5% level. 

The relationship between CFF and 
the light-time percentage of the 
flicker cycle for the various age 
groups is presented in Fig. 1. For 
the purpose of clarity, Os were 
divided into four groups, of approxi- 
mately a 20-yr. span. In the upper 
part of the figure are presented the 
observations obtained with a sur- 
round luminance of 23.6 ml., and in 
the lower part are those for a surround 
of .04 ml. From these curves, and 
from the data presented in Tables 
1 and 2, it becomes apparent that 
the change in CFF with age is a 
function of light-time percentage of 
the flicker cycle. The age differences 
are most marked when the proportion 
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Fic. 1. Relationship between CFF and the 
light-time percentage of the flicker cycle for 
four age ranges, when surround illumination is 
23.6 ml. (top), and .04 ml. (bottom). 


of light in the LDR is low, and the 
differences become progressively less 
noticeable as the LP increases. Al- 
though the CFF values obtained 
at the higher LP’s lack the regularity 
obtained at the lower LP levels, 
differences are still apparent when the 
extreme age groups are compared. 
Inspection of the data also revealed 
that there is a systematic difference 


TABLE 3 


PERCENTAGE OF Os IN THE VARIOUS AGE RANGES 
Reacuinc Maximum CFF at tue Dirrerent 
Licut-Time Percentace Levets 
FOR THE Two SuRROUND 
LUMINANCES 


Percentage of Light in the Flicker Cycle 


Age 


Range | 23.6 ml. Surround 04 ml. Surround 


10% | 25%| 40% 50%| 10% | 25%| 40% 50% 


oe | $8 |e 
30-48 | 2 66 


oe | 0 3 | e 
70-89 
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among the individuals in the LP 
level at which the maximum threshold 
reading is obtained. For all the 
individual CFF-LDR curves a com- 
parison of the critical frequencies 
at the various LDR’s under both 
conditions of surround luminance 
revealed that CFF increased slightly 
as the LP increased, and then reached 
a maximum between an LP of 10% 
and 50%. It can readily be seen 
from the data presented in Table 3 
that the younger Os reach their maxi- 
mum CFF value when the LP of the 
cycle is 10%, and as age increases, the 
maximum CFF progressively shifts 
to higher LP levels up to 50%. When 
the light comprises more than 50% 
of the cycle, CFF decreases as the 
LP increases for all Os. 

Examination of the CFF-LDR 
function for all 108 Os for the sur- 
round luminance of 23.6 ml. (bottom 
of Table 1) shows that CFF increases 
from an initial value of 38.86 cps 
at an LP of 2% to a maximum CFF 
of 43.01 cps at an LP of 25%. Beyond 
25% light-time, increases in the LP 
to 40%, 50%, 75%, 90%, 95%, and 
98% result in a progressive decline 
of CFF in which the threshold values 
are respectively 99%, 97%, 86%, 
67%, 46%, and 21% of those ob- 
tained at 25% light-time. Compar- 
ing the CFF thresholds obtained with 
a surround luminance of .04 ml. 
(bottom of Table 2), as the LP in- 
creases from 2% to 25%, CFF in- 
creases from 36.51 cps and reaches a 
maximum of 40.67 cps. Further in- 
creases in the percentage of light 
in the flicker cycle to 40%, 50%, 
75%, 90%, 95%, and 98% result in a 
ond in CFF in which the values 
are respectively 989%, 97%, 85%, 
66%, 47%, and 25% of those ob- 
tained at 25% light-time. 

Under the conditions of lowered 
surround luminance, the CFF values 
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decrease approximately 2 cps from 
2% to 90% light-time. When the 
LP of the cycle is 95%, the lowered 
surround luminance has only a very 
small effect on CFF, and at 98% 
there is a reversal in which the 
threshold obtained under the sur- 
round luminance of .04 ml. is 1 cps 
higher than that obtained under the 
brighter surround luminance. 

The CFF-age decrement for each 
LP level was represented by the best 
fitting straight line. This was de- 
termined by the method of least 
squares for the eight mean CFF 
values obtained with the various age 
groups at each LP level. The rela- 
tionship between chronological age 
and foveal CFF is linear and negative 
for all the LP levels as is shown in 
Fig. 2. The data obtained when 
the surround luminance was 23.6 ml. 
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are on the left, while those obtained 
when the surround luminance was .04 
ml. are on the right. The curves 
for the LP’s of 2%, 5%, and 10% 
have been shifted upward on the 
coordinate grid, otherwise they would 
overlap those for the longer LP’s. 

To compare the extent of the CFF- 
age decrement for each LP level the 
value of the slope of the decline of 
CFF with increasing age was calcu- 
lated by the straight line equation 
Y = mx + k, where Y is the fusion 
threshold, m is the value of the slope, 
x is the LDR, and & the Y-intercept. 
With a surround luminance of 23.6 
ml., for the LP’s of 2%, 5%, and 10%, 
the lines are essentially parallel and 
the slopes are steep with m values 
—.1636, —.1543, and —.1543, re- 
spectively. As the LP beyond 10% 


increases to 90%, the slopes become 


50 
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Fic. 2. Relationship between CFF and age at the various light-time percentages. 
Surround luminance 23.6 ml. left, .04 ml. right. 
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Fic. 3. Relationship between slopes of lines 
shown in Fig. 2 and log light-time/dark-time. 


progressively less steep with m values 
of —.1229, —.0897, —.0786, —.0426, 
and —.0301 corresponding to the LP 
levels from 25% to 90%. At the LP 
levels of 95% and 98%, the pro- 
gressive decline of the slopes is 


reversed with m values increasing to 


— .0385 and —.0508. 

This same relationship also holds 
for the slopes for the various LDR’s 
under the lower surround luminance. 
The decreasing slopes are —.1581, 
—.1570, —.1476, —.1116, —.0779, 
—.0601, —.0353, and —.0317, cor- 
responding to the LP’s from 2% to 
90%, respectively. When the light 
comprises more than 90% of the 
cycle, there is a reversal in the rate 
of decay of the slopes with m values 
of —.0400 and —.0597 corresponding 
to the LP’s of 95% and 98%. For 
both surround luminances the lines 
in Fig. 2 from 10% to 98% appear in 
descending order, showing a regular 
decline of CFF values. Only for 
the LP’s of 2% and 5% is the order 
reversed, yielding lower CFF values 
than those found for 10% light-time. 

The slopes of the straight lines 
expressing the CFF-age relationship 
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shown in Fig. 2 are presented in Fig. 
3 as functions of the logarithms of 
light-time/dark-time. The curve for 
the surround luminance of 23.6 ml. 
is at a slightly higher level than the 
curve for the surround of .04 ml. 
The curves are in general parallel, 
with the exception of the extreme 
LDR’s. In each curve three seg- 
ments can be distinguished. The 
middle segment is almost straight and 
has a steep slope indicating an inverse 
relationship between the slopes of the 
lines in Fig. 2 and the proportion of 
light in the flicker cycle. The por- 
tions of Fig. 3 which deviate from 
the straight line are on the upper left 
(2% and 5% light-time), and at the 
lower right (90%, 95%, and 98% 
light-time). The slopes for 2% and 
5% light-time are not much higher 
than for 10% light-time, and for this 
reason the points drop off to the left. 
With 95% and 98% light-time the 
slopes do not decrease in the same way 
the slopes corresponding to those be- 
tween 10% and 90% light-time de- 
crease. Beyond an LP of 90%, fur- 
ther increases in the light-time resulted 
in an increase of the CFF-age slope. 
This was in contrast to the relation- 
ship observed from 2% to 90%. 


DiscussI1oNn 


Among the conditions known to in- 
fluence CFF is the relative light propor- 
tion of the flicker cycle, but the CFF- 
LDR function has not been well defined. 
The only consistent relationship ob- 
served by the majority of the investiga- 
tors is that, beyond an LP of 50%, CFF 
decreases as light-time increases. Dis- 
crepancies in the experimental studies 
appear primarily in (a) the relationship 
between CFF and LDR with light pulses 
shorter than 50% of the cycle, (4) on the 
specific rate of decline of CFF beyond 
an UP of 50%, and (c) on the particular 
LDR where CFF reaches a maximum. 
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The CFF-LDR function obtained in 
this study is not intended to resolve 
the diversified results of the previous 
studies (1,3,7), particularly since Landis 
(9) has demonstrated that the interaction 
of various physical properties of the 
stimulus, such as intensity and area, 
results not only in a change of the posi- 
tion of the curves but in the direction 
of their slopes. The significant aspect 
of the results obtained with the flicker 
fusion apparatus in this experiment, 
other than the form of the function, 
is the indication that the chronological 
age of O is another interacting variable 
in the determination of the CFF-LDR 
relationship. The brightness of the 
surround, however, does not appear to 
add to the interacting factors since the 
shapes and slopes of the two curves were 
the same. The effect of the two sur- 
round luminances resulted only in a 
shift in the position of the curves on the 
frequency axis. 

The analysis of the CFF-age curves as 
a function of LDR demonstrated that 
the variation of the light-time of the 
flicker cycle produced changes in O’s 
fusion threshold. Of greater significance, 
however, is the fact that the sensitivity 
of CFF as an indicator of the visual 
aging process was enhanced under the 
conditions of stimulation provided by 
the lower light-times. The total drop 
in CFF as a function of age is largest 
and about equal for the light-time per- 
centages of 2%, 5%, and 10% for both 
conditions of surround luminance. Be- 
tween 10% and 75% light-times the 
decrement in foveal CFF is inversely 
proportional to the log light-time/dark- 
time. Further increases in the light- 
time to 95% and 98% resulted in an 
inflection of the Slope-LDR curve indi- 
cating that the slope of the line express- 
ing the CFF-age relationship increased 
slightly. 

It should be kept in mind that this 
experiment was conducted using the 
uncompensated luminance method. The 
increase in luminance accompanying the 
increasing increments in the light per- 
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centage of the cycle makes it difficult 
to ascribe clearly the effects to either 
LDR or luminance. The results ob- 
tained by Coppinger (4) with an LDR of 
1:1 demonstrate that the rate of decay of 
the CFF-age slope increases when the 
level of stimulus brightness is increased. 
This is in contrast to the results of this 
study in which the slope decreased when 
the light-time and, in consequence, the 
level of stimulus brightness increased. 
The range of test area brightness used by 
Coppinger (.09 to 1.74 ml.), however, is 
too limited to provide evidence for the 
proposition that the various differences 
among the age groupsis clearly not attrib- 
utable to the changing luminance levels. 
It is also possible that there may be an in- 
teraction between luminance and LDR. 
Thus, the lower levels of test area 
luminance, together with the lower 
LP levels, may have produced the effect 
such that any increase in luminance 
could have abolished the effect. Clearly, 
the relative contribution of each of these 
variables should be determined. 

In tests of sensitivity of an O, his 
verbal or motor response to a visual 
stimulus is obviously the terminal reac- 
tion to a long chain of events. Thus, 
if response thresholds vary with age, 
it can be safely assumed that the effec- 
tiveness of the stimulus at any or several 
points from the receptor to the effector 
has been altered and may involve condi- 
tions which are ophthalmological, cardio- 
vascular, and/or neural in nature. With- 
out attempting to speculate in terms of 
the formulation of any specific causative 
factor, Coppinger maintains that “many 
of the structural and functional changes 
associated with aging are qualitatively 
similar to the diseased and artificially 
induced conditions which lower CFF 
in younger persons” (4). Therefore, 
in line with this reasoning, and in 
view of the differential effect of LDR 
as noted in this study, it seems reason- 
able to anticipate the likely hypothesis 
that the effect may be manifested in 
situations other than the aging process. 
A review of the literature reveals that 
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possible evidence to support this propo- 
sition is provided by the findings of 
Davis (5). In her investigation, fusion 
thresholds were determined using both 
an episcotister (50:50 LDR), and a 
Strobotac (1:99.9 LDR) at three levels 
of stimulus brightness: .195, .75, and 
3.0 ft.-c. The thresholds provided by 
the Strobotac with the shorter LP proved 
to be more sensitive indicators to the 
physiologic change induced by chemical 
agents. This was true at all test area 
luminance levels. 

The literature is replete with state- 
ments concerning the effectiveness or 
ineffectiveness of CFF as an indicator 
measure, but very few investigators 
have been concerned with the determina- 
tion of the stimulus conditions which 
enhance the use of CFF as an indicator 
response in the assessment of the neuro- 
physiological condition of O. Aside 
from purposes of standardization, there 
is no a priori rationale or any experi- 
mental findings which suggest the con- 
tinued use of the conventional 50:50 
LDR under one level of luminance in 
flicker research. This observation has 
also been made by Winchell and Simon- 
son who state that “the importance of 
the standardization of LDR would de- 
pend on the magnitude of changes 
produced by its variation, and on the 
sensitivity of the CFF at different LDR’s 
in response to physiological stress” (16, 
p. 188). 


SUMMARY 


Binocular critical flicker frequencies (CFF’s) 
were determined for 108 Os ranging in age from 
13 to89yr. The light pulses were in the form of 
square waves activating a glow modulator tube. 
The ratio of light-time/dark-time in the flicker 
cycle was varied systematically from 2/98 to 
98/2. The uncompensated luminance pro- 
cedure was used. The test field consisted of a 
circular test area of .6° subtense shown at the 
center of a circular 17° surround. Tests were 
made at surgound luminances of 23.6 and .04 ml. 

The major results were: (a) The relationship 
between CFF and age is linear and negative 
for all LDR levels under both conditions of sur- 
round luminance. (6) The decrement of CFF 
with age is a function of the light-time percent- 
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age of the. flicker cycle. The differentiation 
between age groups is more pronounced at the 
lower light-time percentages. (c) The light- 
time percentage at which the maximum CFF 
occurs is a function of the chronological age of 
O. (d) The change of the surround luminance 
resulted in a shift of the position of the CFF- 
LDR curves, but the shape and slope of the 
curves were not altered. 

On the basis of the results it was suggested 
that the use of lower levels of light-time in the 
flicker cycle, rather than the conventional 
50:50 LDR, would enhance the sensitivity 
of CFF as an indicator measure. 
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